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ABSTRACT 


Stochastic Marked Graphs are a concurrent decision free formalism provided with a 
powerful synchronization mechanism generalizing conventional Fork Join Queueing 
Networks. In some particular cases the analysis of the throughput can be done an- 
alytically. Otherwise the analysis suffers from the classical state explosion problem. 

Embedded in the divide and conquer paradigm, this thesis introduces approx- 
imation techniques for the analysis of stochastic marked graphs and Macroplace/- 
Macrotransition-nets (MPMT-nets), a new subclass which is introduced in this the- 
sis. MPMT-nets are a subclass of Petri nets that allow limited choice, concurrency 
and sharing of resources. The modeling power of MPMT is much larger than that of 
marked graphs, e.g., MPMT-nets can model manufacturing flow lines with unreliable 
machines and dataflow graphs where choice and synchronization occur. 

The basic idea leads to the notion of a cut to split the original net system into 
two subnets. The cuts lead to two aggregated net systems where one of the subnets 
is reduced to a single transition. A further reduction leads to a basic skeleton. The 
generalization of the idea leads to multiple cuts, where single cuts can be applied 
recursively leading to a hierarchical decomposition. 

Based on the decomposition, a response time approximation technique for the 
performance analysis is introduced. Also, delay equivalence, which has previously 
been mtroduced in the context of marked graphs by Woodside et al., Marie’s method 
and flow equivalent aggregation are applied to the aggregated net systems. 

The experimental results show that response time approximation converges 
quickly and shows reasonable accuracy in most cases. The convergence of Marie’s 
is slower, but the accuracy is generally better. Delay equivalence often fails to 
converge, while flow equivalent aggregation can lead to potentially bad results if a 
strong dependence of the mean completion time on the interarrival process exists. 

viii 



1.1 Motivation 


CHAPTER 1 

Introduction 


Manufacturing systems, computer systems and robotics systems cannot be de- 
scribed by differential or difference equations. System changes happen in random 
intervals of time, and are therefore stochastic in their nature. These systems are 
commonly referred to as discrete event dynamic systems (DEDS). Modeling, control 
and the analysis of DEDS represents a big challenge. Robotics systems composed 
of multiple arms, a vision system, force/torque sensor and control computer for 
example, are discrete event in their operation and have a rather large number of 
interacting components. It is the interaction of the subsystems coupled with the 
discrete nature of their dynamics which makes the modeling and performance anal- 
ysis challenging [AJD90a], 

In the modeling and analysis of DEDS there is a trade-off between the ability of 
how realistically our model is able to represent the actual system (modeling power), 
the cost and effort involved in the modeling process (economical considerations) and 
the analytical tractability , i.e., the ease of analysis of our model. Ideally we would 
like to have a single model, which can express the following characteristics m one 

representation. 

• Logical properties 

1. Asynchronous operations. The model should be concerned with the in- 
dependent “internal clocks”. 

2. Concurrency and parallelism. Several activities can take place at the 
same time. 
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3. Conflict. This may occur when two or more processes require the same 
resource or a part can take several paths through the system. 

4. Synchronic distance. How often can transition A fire before transition B 
has to fire. 

5. Deadlock. A state can be reached where none of the processes can con- 
tinue. For example this can happen with the sharing of a resource be- 
tween two processes, e.g., two robots want to acquire a vision system 
simultaneously. 

6. Event driven. Since operations can occur concurrently, the order of events 
is not necessarily unique. 

- • Temporal properties such as the duration of activities and timeout. 

• Operational properties such as real time scheduling using mathematical or 
heuristic models and high level decision making. 

Figure 1.1 shows the hierarchy of the modeling process. As we move to the top 
of the pyramid, we can increasingly model the desired characteristics of the system. 

At the bottom of the modeling hierarchy are product form queuing networks 
(PFQNs) which are computationally very easy to solve, but this computational sav- 
ing has a drawback: classical PFQNs (Jackson QNs, BCMP QNs) cannot model 
blocking, finite buffer size, synchronization and control over the dispatching of cus- 
tomers. Therefore, their modeling power is low. Because of their computational 
ease, PFQNs are most useful for the initial rough design and analysis of a system. 

Petri nets (PNs) allow the modeling of the logical properties of the system. 
Depending on the net subclass, polynomial time algorithms exist for the analysis of 
liveness, boundedness and deadlock. The incorporation of time leads to generalized 
stochastic Petri nets (GSPNs) where transitions can fire in zero time (immediate 
transitions) or after an exponentially distributed amount of time (timed transitions). 
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Figure 1.1: Hierarchical Representation of the Modeling Process 


The underlying assumption is that the system is memoryless (Markovian), i.e., the 
next state depends only on the current state and not on the history of the previous 
states which were visited, or how long the system has been in the current state. The 
only continuous probability distribution which satisfies the Markovian property is 
the exponential distribution. 

GSPNs can be interpreted as a generalization of traditional queuing net- 
- works (QNs) allowing synchronization, fork, split, choice and timeout constructs 
(see [VZL87] for a comparison of the modeling power between QNs and GSPNs). 
In order to determine the performance measures, the underlying continuous time 
Markov chain (CTMC) is constructed from the reachability graph and the corre- 
sponding system of linear equations is solved. 

GSPNs can model a system with considerable level of detail. The major draw- 
back for the use of GSPNs for the performance analysis is the problem of state 
space explosion. Even a moderately sized system can have thousands of states, and 
the solution of the CTMC becomes increasingly space and time consuming. Using 
more powerful computers or more efficient algorithms would help, but state space 
explosion is exponential , while faster algorithms [JGD9I] or increased memory are 
a polynomial improvement at best. Ultimately, it is the problem of state explo- 
sion which renders the analysis of complex systems using GSPNs difficult. GSPNs 
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have been used for the analysis of computer systems [Mol82, Beo85], flexible man- 
ufacturing systems [AJ88, KDD88, BCFR87, CCS90] and manufacturing message 
specifications [WJG91]. GSPNs are most useful in the intermediate design state 
where various options are being explored, e.g., different buffer sizes, machining or 
processor speeds can easily be explored. Once a PN model has been found, it can 
also be used as the underlying structure for the simulation model [HS89, BC90]. 

Further up in the hierarchy is simulation, one of the most widely used tech- 
niques in operations research. Simulation allows an “as good as possible” approach 
to the modeling of systems, i.e., almost every imaginable detail of the system in the 
model can be captured in the simulation, e.g., the probability distributions for the 
processes can be arbitrary. Models of large scale systems tend to be very complex, 
so the coding and validation of the model can be very arduous. 

The drawback for the high modeling power is the lack of any kind of method- 
ological analysis. Furthermore, in order to obtain small confidence intervals, the 
simulation runs tend to be very long and expensive. The results of a simulation 
run have to be analyzed with caution, because we do not get exact performance 
measures, but only a (hopefully) close estimate. Simulation enjoys a high credibility 
and is most useful for the final stage of the system design for the fine tuning of the 
system and the evaluation of different scheduling policies, etc. 

Often we are interested in finding the optimal allocation of resources. This 
involves analyzing the sensitivity of the system with respect to the change of the 
parameters we are interested in. In order to find the gradient at an operating point 
we usually use a finite difference approximation. Therefore, we require two runs: 
one with the nominal set of parameters, and a second one where the parameter of 
interest is slightly perturbed. On one hand we want to make the perturbation as 
small as possible in order to get the true gradient. But this involves long run times 
in order to get the small confidence intervals which are necessary. On the other 
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hand, we want a large increment in the parameter in order to have short run times, 

but in general this will not give us the true gradient. 

The efficiency of simulation can be increased by perturbation analysis where it 
is possible to find the gradient with respect to the change of parameters based on 
one long simulation run [HC79, HC83, Sur89]. The parameters can be continuous 

(e.g., processor speed) or discrete (e.g., buffer size). 

At the top of the hierarchy is building a prototype of the system which is the 
most "reliable” way of analyzing the performance of a system, but also the most 
expensive. Prototyping is done once the system has been designed and no further 
changes are expected. 

1.2 Goal of this Research 

The major problem for the use of GSPNs is the problem of state space ex- 
plosion. The goad of this research is to move GSPNs up in the modeling hierarchy 
shown in Figure 1.1, so we can use Petri nets for models which previously could only 
be analyzed by simulation or heuristics due to the large state space. 

In the study of QNs, efficient algorithms have been developed for the approxi- 
mate analysis of QNs whose topology (or service rules) locally violates the product 
form assumption [LZGS84, Mar79, ABS84]. These QNs are called * almost ” PFQNs. 
The approximate analysis is done in two steps: 

1. In the structural decomposition , the QN is partitioned in such a way that the 
topology and routing probabilities can be interpreted as being from a PFQN. 

2. During the approximate analysis, the system is analyzed either iteratively 
(Marie’s method, [Mar79], response time preservation [ABS84]), or in a single 
step (flow equivalent aggregation [LZGS84]). 

In this work, we study the problem of reducing the underlying state space of 


iAUTY 



6 


the CTMC in order to allow the analysis of large systems. 

The basic approach is to partition the GSPN by a single cut and substitute 
the subnets by simpler constructs resulting in aggregated net systems, thus reducing 
the state space. At the same time, the logical properties of the unaggregated parts 
of the net system are preserved. 

We combine features from queueing theory and the results from the struc- 
tural analysis of Petri nets. We start out from marked graphs (MGs), a concurrent 
decision free formalism provided with a powerful synchronization mechanism gen- 
eralizing conventional QNs. For the structural decomposition of MGs, we make 
extensive use of the structural theory of MGs. In QNs, the characterization of the 
workload of PFQNs (or ” almost ” PFQNs) is usually not a problem, while in MGs 
this step is not obvious. Here we define the workload of a server as the maximum 
number of customers in its input queue. In Petri nets, places represent queues while 
timed transitions represent servers. Furthermore, in the decomposition phase, we 
want to preserve some marking properties of the aggregated systems. 

Once the MG is partitioned, any of the available approximate techniques for 
QNs (Marie’s method, FEA) or delay equivalence [LW91, WL91] can be used. The 
latter is an iterative technique and was first introduced in the context of MGs. In 
addition, a new iterative technique, response time approximation (RTA), specifically 
designed for the case of a single cut is introduced, which shows good convergence 
properties. The four methods are discussed and the computational effort and accu- 
racy discussed. 

MGs are then generalized to macroplace/macrotransition nets (MPMT-nets), 
a new subclass of Petri nets based on the combination of the basic structures of 
state machines (SMs) and MGs. The practical modeling with this new subclass of 
net systems has some advantages in the field of manufacturing with respect to the 
use of the well known subclass of live and bounded free choice net systems, allowing 
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timeout and the limited sharing of resources. 

The modeling power of MPMT nets is much higher than that of state machines 
(SMs) or MGs alone. RTA can therefore be seen as a more general approximation 

technique. 

The work is organized as follows: In Chapter 2 we briefly review some of 
the pertinent literature of GSPNs and QNs. In Chapter 3 we establish the Petri 
net notation and review some of the important properties of PNs. In Chapter 4 
we present the structural decomposition of MGs and a new iterative approximation 
technique (RTA) as a hierarchical decomposition approach Petri nets based on RTA. 
In Chapter 5 we present other approximation technique for QNs in the context of 
Petri nets and a recently introduced approximation technique for MGs. In Chapter 
6, we present numerical results for MGs. In Chapter 7, a new subclass (MPMT-nets) 
of Petri nets is introduced, and several numerical examples are given m Chapter 8. 
Chapter 9 introduces SIMO cuts for MPMT-nets. In Chapter 10, a dual arm robotics 
testbed is examined. Conclusions are given in Chapter 11. 



2.1 Introduction 


CHAPTER 2 
Literature Review 


This chapter discusses the pertinent literature of QNs and GSPNs. First in 
Section 2.2 we give a brief review about Jackson and Gordon-Newell QNs. In Section 
2.3 four general approximation techniques for QNs are briefly reviewed. 

We then concentrate on efforts to contain the problem of state explosion in 
GSPNs. A very efficient computational approach, where bounds of the through- 
put can be found in polynomial time based on the net description, is presented in 
Section 2.4.1. In Section 2.4.2 the efficient construction of the underlying CTMC 
is examined. In Section 2.4.3 an iterative solution method for weakly interacting 
subnets is discussed. Finally in Section 2.5 results of an approximation method 
based on the combination of QNs and GSPNs is reviewed. The iterative method 
for the approximate solution of stochastic marked graphs introduced by Woodside 
[LW91, WL91] et al. as well as Marie’s method will be discussed in detail later in 
this work once the structural properties of the decomposition have been defined. 

2.2 Jackson Queuing Networks 

Jackson [Jac57, Jac63] considered an arbitrary network of queues with N 
nodes, each consisting of m, exponential servers with rate /*,. The state of the 
system is denoted by the vector h = (* l( k 2 . . . k N ), where is the number of cus- 
tomers in the I'-th service center i and I< = ZfLi is the total number of customers 
m the system. The outside arrival of customers to the system is modeled by a Pois- 
son process. An example of an open Jackson queueing network is shown in Figure 
2 . 1 . 
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Let A be the outside arrival rate of customers to the QN. The routing of 
customers through the QN is modeled as a random walk through the service centers. 
The customers leave the QN with probability one after a finite time. The routing is 
described by the matrix R = r(t, j), i — 0 . . . N, j 1 . . . N + 1. 

j»(0, j) is the probability that center j will be first on a routing, 

r(i,N + 1) is the probability that if center i is I th on a routing then it is the 
last one, 

r(i,j) is the probability that if center i is I th on a routing, then there is an 
(l + l) 4 * element and it is center j. 

The matrix R defines the random walk of the customer through the system. 
For the sake of simplicity, let mi = 1, * = 1 ...N. The traffic flow equations (flow 
into a service center = flow out of a service center) are given by: 

7i = \r(0,l) + 'tr(j,l)T j (2-1) 

i= i 

where T; is the total throughput of costumers at service center l. Jackson proved 
that the steady state solution p(k) is given by: 

p(ki,k 2 ,...k 1 w) = pi(ki)p2(k 2 ) ■ ■ • PNikx) (2.2) 

where p,(fc<) is the solution to the classical M/M/m queueing system, i.e., the prob- 
ability of finding k x costumers in the queue at service station i is. 

Pi(ki ) = Cip? ( 2 - 3 ) 

Ci is a normalizing constant chosen such that £«o P*( a ) = P' ~ Til Pi- K r(i, JV + 
1) = 0 and A = 0 no outside arrivals are permitted and the number of customers 
in the system is constant. Such a queuing network is known as a Gordon-Newell 
queuing network [GN67]. The structural equivalent of Gordon-Newell QNs in Petri 
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Figure 2.1: Example of Jackson Queuing Network 

nets are state machines. The class of PFQNs was extended by the work of Baskett, 
Chandy, Muntz and Palacios [BCMP75], The BCMP QN allows several classes 
of customers in the QN and different scheduling disciplines. The BCMP model 

has been expanded to include batch arrival and batch service in some special cases 
[HPTD90, HT90]. 

As soon as synchronization, limited buffer size, or the sharing of resources is 
present, a product form no longer exists. If only a “few” service centers violate the 
product form assumption, we loosely say the QN has "almost ” product form. 


2.3 Aggregation in Queueing Networks 

All of the methods presented in the following are concerned with finding the 
equivalent service rate of the subnetworks to be aggregated. The service rate can 
be state dependent (i.e., dependent on the number of customers in the subnetwork) 
or state independent. See Table 2.1 for a comparison. 

The basic approach is to set the conditional throughput (obtained under a 
controlled environment) to the service rate of the subnetwork, which in turn is 
substituted back in the original system. 
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Method 

Arrival Process 

Service Rate 

Solution Method 

Flow Equivalent 
Aggregation 

short circuit, fixed 
number of customers 

state 

dependent 

single step 

Response Time 
Preservation 

state independent 
Poisson process 

state 

independent 

iterative 

Marie’s Method 

state dependent 

state 

dependent 

iterative 


Table 2.1: Comparison of Approximate Methods for "Almost” PFQNs 


2.3.1 Near Decomposabiiity 

If the order of the transition probabilities HoPoi PiPio between Rq and Ri 
in Figure 2.2.a is much greater than the order of magnitude between the transitions 
of R 2 , then the QN formed by Rq and R x wiU obtain local equilibrium before any 
interaction with R 2 can occur [Cou75]. A system with this property is known as 
nearly decomposable. The network can (approximately) be analyzed the following 
way: Rq and R x are analyzed separately. We are interested in the flow xfruM of 
customers out of the subsystem, where ni is the number of customers in Rq and 
Rx. An aggregate model can be constructed which is shown in Figure 2.2.b. This 
approach was applied to GSPNs in the work of [BT86]. 

2.3.2 Flow Equivalent Aggregation (FEA) 

The basic approach is to replace a general server or subnetwork of queues by a 
flow equivalent service center (FESC) [Bra85, LZGS84]. An arriving customer sees 
the FESC as a black box whose behavior is completely characterized by a listing 
of the residence time (the inverse of the throughput) as a function of the possible 
customer population. In order to determine the state dependent service rates, the 
subsystem is studied offline, i.e., without any interaction with the environment. In 
general, a FESC can only match the first moment of the probability distribution, 
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(a) System (b) Aggregated System 


Figure 2.2: Nearly Decomposable System 

as the higher moments are too cumbersome to obtain. Three steps for constructing 
an FESC can be determined [CS78]: 

1. Determine a subsystem which can be analyzed by FESCs. 

2. Analyze the subsystem by maintaining the number of customers constant. 
This is done by shorting out (Norton’s theorem) all other parts of the network 
and varying the number of customers up to the maximum allowed in the 
subsystem. 

3. The aggregated network AN is constructed as a server with a queue-dependent 
service rate. Let Xsw{k) be the conditional throughput of the subnetwork 
(SVF) when k customers are present, and h A n{^) the (state dependent) service 
rate of AN. The approximation is done through the following equality, under 
the assumption of exponential service time: 

X S w[k) = fi A N{k) (2.4) 

Figure 2.3 illustrates the concept from an example of computer systems [LZGS84]. 

Here the set of queues surrounded by the dashed box would be analyzed in isolation 
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Figure 2.3: Concept of Flow Equivalent Server 

and substituted by a single queue with a state dependent service time. 

In FEA we assume that the behavior of the subnet is independent of the 
arrival process and depends only on the number of customers in the system, i.e., the 
behavior is completely independent of the environment. This condition is violated 
in several cases (e.g., if internal loops are present). Thus in general, the mean 
completion time (MCT) depends on the interarrival time of customers. FEA does 
not always lead to an approximate solution, as the following property shows. 

Property 2.3.1 (Flow Equivalent Aggregation [Van78]) For the particular 
case of product form QNs, flow equivalent aggregation leads to exact results. 

This approach does not involve any iterative process and is computationally 
efficient. As we do not iterate there exists no convergence problem. Courtois [Cou77] 
showed that the approximation is good when the number of state changes within the 
subsystem between interactions with the environment is reasonably large. Chandy 
et al. [CS78] observed that if the actual coefficient of variation of the service time is 
much larger than the coefficient of variation of the exponential distribution, signifi- 
cant error can be introduced. FEA techniques have been introduced in the context 
of SPNs [JD91a, JD91b] and are presented in Section 5.2. 

2.3.3 Marie’s Method 

Marie’s method provides an alternative way to derive the state dependent 
service rate of subsystems [Mar79] in an iterative fashion. While in FEA the number 
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of customers in the system is held constant, and in RTP the arrival rate is held 
constant, in Marie’s method the arrival and service rate is assumed to be state 
dependent and must be found in an iterative manner. While the network may 
possess general service times, blocking and fork/join operations, the QN must be 
decomposable in the sense that the partitioned subnetworks can be interpreted as 
servers in a PFQN (i.e., the QN is in "almost” product form). To this end, the QN 
is partitioned into M subsystems according to the following definition: 

Definition 2.3.1 (Product Form Decomposition) [BD90] A non product 
form QN is said to be decomposable into product form if: 

1. The transitions between subsystems are single transitions, e.g., bulk service or 
arrival between subnetworks is not allowed. 

2. The behavior of the subnetwork depends only on its internal state, e.g., the 
service rate cannot depend on the queue length of another subnetwork. 

3. The input/output behavior of a customer is independent of routing decisions 
taken during its stay in the subsystem, i.e., the visit ratio is unique. 

4 - All fork/join operations happen within the subsystem. 

After the partitioning, each subsystem is iteratively analyzed in isolation under 
a state dependent Poisson arrival process. As with many iterative methods, the 
uniqueness of the solution can not be proven although numerical experience has 
shown that a unique fixed point does indeed exist, although convergence sometimes 
presents a problem [BD90]. 

In [BD90], an introduction and comparison between Marie’s method and FEA 
is given. The authors conclude that the results obtained using Marie’s method axe 
usually closer to the actual solution than those obtained with FEA, especially when 
large coefficients of variation axe encountered. In terms of computational effort, 
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Marie’s method seems to be more efficient for large values of N , while FEA is more 
efficient for small values of N. 

A detailed discussion of the numerical aspects of Marie’s method and its ap- 
plication to GSPNs is presented in Section 5.3. 


2.3.4 Response Time Preservation 


In response time preservation (RTP) [ABS84], the QN is first decomposed 
according to the rules in definition 2.3.1. In the stochastic phase, we replicate the 
response time of a general server G with first come, first served (FCFS) discipline 
by an exponential server as shown in Figure 2.4. Here the assumption for the 
analysis phase is that the complement of the general server (shown in the dashed 
box in Figure 2.4.a) generates a Poisson arrival rate fi of customers. The isolated 
subsystem shown at the bottom of Figure 2.4.a is the classical M/G/l network of 
two queues: a single exponential server of rate [i and a complex server with general 
service time G. Let s be the service time, CV be the coefficient of variation and X be 
the throughput of the general server (which could also be formed by a subnetwork). 
By the PoUazcek-Khinchin formula [Kle75], the response time for an open M/G/l 


queue is given by: 


RTq = s + 


(l + CV*)s'U 


(2.5) 


2(1 — U) 

where U = Xs is the server utilization. During the aggregation phase RT a is 
replicated by the response time RT' of the isolated exponential server as shown in 
Figure 2.4.b. 

RTn(u) = RT'(u) (2-6) 


RT ' for an open M/M / 1 with service time s' is given by: 


Ctf PC V . ■"? 


RT' = 


l-Xs' 


(2.7) 
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Solving equation (2.5) and (2.7) for the unknown service time s' yields 

,, _ BTg 

1 + XRT g ( 2 ‘ 8 ) 

In the last step the exponential server is substituted back into the original system to 
yield the transformed model which is now in product form. Ideally we select p such 
that the M{p)lG/\ queue has the same response time as the system in Figure 2.4.d. 
As we do not know the throughput a priori , we also cannot find the correct Poisson 
arrival at the beginning. An iterative process is therefore concerned with finding p. 
Note that in RTP, the coefficient of variation of the service time is required. RTP 
yields good results for moderate coefficients of variations (CV < 5) of the customer 
interarrival time [ABS84]. 

Iterative methods are especially suitable when the interaction of the subnet 
with the environment is strong. The drawback is that convergence might eventually 
present a problem. 
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2.4 Approaches to Contain the Problem of State Explosion in General- 
ized Stochastic Petri Nets 

2.4.1 Bounds on the Throughput Vector 

Whenever possible it is desirable to base the analysis of a PN on algorithms 
which are of polynomial complexity on the net structure rather than on the reacha- 
bility graph (whose construction is of exponential complexity). J. Campos et al. use 
a linear program based on the incidence matrix to find upper and lower bounds for 
the throughput of MGs [CCCS92]. These bounds are tight, i.e., there exist proba- 
bility distributions which will reach the bounds. The results have been generalized 
to Petri nets with a unique repetitive firing count vector [CCS91a] and free choice 
nets [CCS91b, Cam91]. Some of the results are presented in Section 3.4.2. 

2.4.2 Efficient Construction of the Tangible State Space 

Tangible markings correspond to states where the system spends some amount 
of time, while vanishing states often correspond to logical states and represent some 
intermediate decision process. Typically during the construction of the reachability 
graph, quite a few number of vanishing markings are encountered [BCR87]. In the 
solution of GSPNs we are more constrained by memory than time (at least when 
using the software package SPNP [DBCT85]), so much could be gained by using a 
technique which avoids the construction of the entire state space, and would reduce 
the number of vanishing states which have to be considered when reducing the 

CTMC to the set of tangible markings. 

In [BCFR87] G. Balbo et al. examine the problem of the efficient construction 
of the tangible reachability graph. The reduction is done at a structural level, 
partitioning the GSPN into confusion free subnets. The reachability graph of each 
subnet is generated and the set of tangible states can be found. 
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2.4.3 Iterative Method for the Solution of Generalized Stochastic Petri 
Nets 

In [CT91b, CT91a] Gianfranco Ciardo et al., propose an iterative (approxi- 
mate) method for the solution of GSPN models. They identify several Petri net 
constructs which interact weakly. The solution of the GSPN is obtained by itera- 
tively solving the subnets. In addition to tight synchronization ( parbegin-parend ), 
the following constructs were identified: 

• Processor-Sharing: In this construct, two disconnected subnets share an 
implicit resource, i.e., a resource which is not explicitly represented by a place. 
Now the firing rates of two subnets are mutually dependent on each other 
depending on the amount of resources the other subnet has already used. 

• One-way condition testing: Figure 2.5 shows a GSPN with the one way 
condition testing. Transition ti is enabled only if place p\ is marked, but 
its firing does not remove a token from pi. As Subnet SN 2 can be solved in 
isolation, and the results are going to be exact. In order to solve for the subnet 
SN 2 , two methods have been proposed: 

1. Rarefaction: The argument here is that the absence of the marking of p 1 
can only slow down transition The equivalent firing rate for t\ can 
then be computed as: 

AJ , * W = A° W P (pj is marked} 

2. Compression: Here the idea is to compress the behavior of subnet SjV x 
into a small subnet SN 3 . Now SN 2 and SN 3 can be combined for the 
analysis of SN 2 . The analysis is going to be exact if the time in which 
Pi is marked is exponentially distributed. 
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SNi \ j SNi 



Figure 2.5: One Way Condition Testing in a Petri net 



• Producer and Consumer 

The construct is shown in Figure 2.6. This construct represents a synchro- 
nization between the two processes of net SNi and SN?. 

2.5 Approximate Aggregation in Generalized Stochastic Petri Nets 

G. Balbo et al. combined QNs with GSPNs for the computationally efficient 
analysis of complex systems [BBG86, BBG88]. They combine features from QNs 
and GSPNs for an efficient analysis. In [BBG86] they start from a GSPN model. 
They then identify subnets which allow for a product form solution, and find flow 
equivalents for these subnets. These subnets are then substituted into a high level 
GSPN. This allows for a reduction of the state space. 

The authors use a hybrid modeling approach. Parts of the system are modeled 
by QNs (either directly in QN notation or simply by state machines), while other 
parts are modeled by GSPNs. If the QN has a product form, then the flow equivalent 
server can be found analytically, by solving the PFQN. If the QN does not have a 
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product form, then the QN has to be solved (for example by means of a GSPN), 
and a flow equivalent server constructed. In this case the flow equivalent server has 
the form of a timed transition with a marking dependent firing rate. 

In both papers, Balbo et al. use QNs as the underlying model, and only use 
GSPNs whenever necessary, to represent parts which cannot be represented by QNs. 
In the proposed work, we want to use GSPNs as the underlying model and only use 
the techniques adapted from QNs for the analysis, rather than QNs themselves. 

2.6 Summary 

In this chapter we have briefly reviewed some of the relevant literature for 
QNs. Of special interest are FEA, RTP and Marie’s method as they are well known 
methods for the approximate analysis of QNs. Later in this work, we will use these 
techniques to devise efficient approximation techniques for MGs. 

An approximation method for MGs by Li et al. will be reviewed once the 
structural decomposition of MGs has been defined. 



CHAPTER 3 

Terminology and Basic Properties for Petri Nets 
3.1 Introduction 

In this section the basic definitions for Petri nets are introduced. It is assumed 
that the reader is familiar with the basic concepts of Petri nets. See [Pet81, Mur89, 
Sil85] for an introduction. 


3.2 Terminology 

Definition 3.2.1 (Petri net) A Petri net structure is a bipartite directed graph 
represented by a quadruple Af = {P,T, Pre, Post) where, 


P = {Pl,-,Pn} 

T — {tlj •••) i-m } 

Pre : P xT IN 

Post :T x P t-+ JV 


is a finite set of places, n = \P\ 

is a finite set of transitions m = |T| 

is an input function that defines the set of directed 

arcs from P to T 

is an output function that defines the set of directed 
arcs from T to P 


JV is the set of nonnegative integers. Pf]T — 0, PUT ^ 0. 

Here, Pre and Post axe linear incidence functions representing the input and 
output arcs respectively. 

Alternatively, a graph-based perspective of the net structure is given by Af 
(P, T, F, W) where F is a flow relation, FC{Px T)U(T x P), and W is a weighting 
function on F. If the net is ordinary (all weights are 1), W can be removed (i.e., an 
ordinary net Af can be defined as Af = ( P,T,F )). Here we consider only ordinary 

nets. 
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Definition 3.2.2 (Incidence Matrix) The incidence matrix of the net structure 
is the n x m matrix C, where C = Post - Pre. Post, Pre and C represent 
matrices, while Post{tf), Pre{t { ) and C(t,') represent the respective column vectors 
corresponding to transition t,, while l{p { ) represents the row vector of p { in C. 

Definition 3.2.3 (Marking) A marked Petri net ( ff ,Mq), is a net provided with 
an initial marking M 0 . M(p) refers to the number of tokens residing in p. 

In this thesis net refers to the structure, while net system refers to the structure 
plus some initial marking. 

Definition 3.2.4 (Firing of Transitions) Transition t is finable (enabled) at 
marking M, if M > Pre{t). If the firing oft leads to M' (i.e., M[t)M ') then 

M' = M + Post(t ) — Pre(t) = M + C(t) (3.1) 

Equation (3.1) is referred to as the linear state equation. Assuming <j = 
tnt i2 . . . t ir is a sequence of transitions finable from M 0 (i.e., M 0 [cr)M), then M = 
Mo + C ■ o, where a is the m x 1 firing count vector (i.e., the firing order is lost). 
If a is a vector, then o{j) is the scalar representing the number of firings oftj in a. 

The pre-set and post-set of a transition and place are defined as follows: 
Definition 3.2.5 (pre-set, post-set) 

*t » {p\Pre(p,t) > 1} = pre-set oft, i.e., the set of input places oft 
** = {p|-Post(t,p) > 1} = post-set oft, i.e., the set of output places oft 
*P = (f| Pre(p,t) > 1} = pre-set ofp, i.e., the set of input transitions of p 
P ~ ^ 1} = post-set ofp, i.e., the set of output transitions ofp 

Definition 3.2.6 (reachability set) M n is said to be reachable from the initial 
marking iff there exists a firing sequence <r that transforms M 0 into M n . The reach- 
ability set R(N,Mo) is the set of markings which are reachable from the initial mark- 
ing Mo. A marking M is potentially reachable iff 3X >03 M = M 0 + C ■ X > 0 
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Definition 3.2.7 (home state [Cam90]) A state M is a home state iff'iM' € 
R(N,M 0 ), M € R{N, M'). Home states form a closed (i.e., mutually reachable) 

set. 

Later we will see, that the presence of home states leads to ergodic systems. 

Definition 3.2.8 (Marking Bound) Let (Af,M 0 ) be a net system 

1 . The bound of a place p € P is 

B(p) = max{M (p) | M € R(Af ,M q)} (3-2) 

where M'{p ) refers to the number of tokens in place p. 

8. The structural bound of a given place p of Af is given by the following linear 

program: 

SB(p) *= maximize M(p) 

subject to M = Mq + C • 3 > 0 (LPPl) 

a > 0 

A net M is structurally bounded iffVMo the net is bounded. 

i.e., all net systems constructed from A f are bounded. 

Definition 3.2.9 (persistent place) A place is persistent iff\p*\ = 1 

Once a persistent place is marked, the routing of the token out of the place is 
deterministic. 

Definition 3.2.10 (Liveness) A transition t € T in (N,M 0 ) is live if 
VM € R(N, Mq) : 3 M* € R(N, M ) 3 M' > Pre{t) 


(N, M 0 ) is live iff all transitions are live. 
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(Marked places are 
indicated by letters) 


Figure 3.1: Ordinary Petri Net System and its Reachability Graph 

Definition 3.2.11 (Invariants [Cam90]) A T-invariant (or consistent compo- 
nent) is an m-vectorX :T>-+ JV such thatC-X = 0 and X ^ 0. A P -invariant (or 
conservative component) is a n-vector Y : P t-+ iV such that Y T • C = 0 and Y ^ 0. 
The support of a T-invariant (P -invariant) denoted by \\X\\ (\\y\\), is the set of 
transitions (places) such that \\X\\ = {f € T\X{t) > 0} (\\Y\\ = {p <= P\Y(p) > 0};. 
A (P orT) invariant I is: 

1. Minimal iff there exists no other invariant I ' such that ||/'|| C ||/[|. 

2. Canonical iff the greatest common divisor is 1. 

3. Elementary iff it is minimal and canonical. 

The nonzero solution to C • X = 0 ( Y T - C - 0) is also called the T-semiflow 
(P-semiflow). 

A firing count vector which leads back to the original marking is a T-invariant 
(but not necessarily vice versa). A net is consistent iff all transitions are covered 
by T-invariants, i.e., 3 a T-invariant X 3 X > 1 (1 is a vector of appropriate size 
whose entries are all one). In the support of a P- in variant, the weighted sum of 
tokens over a set of places is constant for any marking. 
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Figure 3.1 shows am example of a simple Petri net. For the given initial mark- 
ing, there are five possible states (all states are home states). The incidence matrix 

is given by: 


ti ^2 ^3 ^4 ^5 

1-1001 
-1 1 0-1 0 
01-101 
0-1 1-1 -1 
0 0 0 1 -1 


A 

B 

D 

E 

F 


(3.3) 


Solving for the invariants yields two minimal P-invariants: 


y x T = [1 i o o l], r 2 r = [o o l l l] 


Two minimal T-in variants exist: 

Xj = (1110 0] T , x 2 r = [0 1 2 1 1] T 

Consistency and boundedness do not guarantee other “nice” properties such as 
the existence of home states or even liveness. Figure 3.2 shows the example of 
a structurally live and structurally bounded Petri net system without home states 
and its reachability graph. Depending on whether U or f 2 are fired first, the resulting 
states are disjoint and form closed sets. The net has 3 minimal T-invariants. 

X 1 T = [lllllllllll T , Xj = [201120202 2] t , X\ = (00 1 1020202] T 

For the given initial marking, only Xf leads to a feasible firing sequence, i.e., 3a 3 
$ — XT. In general, the existence of home states depends on the initial marking! 

Figure 3.3 shows a consistent, structurally live and bounded Petri net, which 
has an interesting property: By increasing the initial marking (e.g., m 0 (p 5 ) = 1), 
the net system can deadlock. 
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A 



(a) Live and Bounded Petri Net System 



(b) and its Reachability Graph 

Figure 3.2: Live and Structurally Bounded Petri Net System Without Home States 


Figure 3.3: Structurally Live, Consistent and Bounded Petri Net System 


3.3 Subnets 

Definition 3.3.1 (Single Cut) A single cut £ traced through a set of interface 
places Q splits a net jV = ( P,T,F ) into two subnets SNi,= ( Pi,Ti,F { ) i = 1,2, 
where 

• t x u r 2 = r, t x n t 2 = 0 

• Px U P 2 = P, Px n P 2 = Q { the interface} 

• Fi = F D ({Pi x Ti ) U (Ti x Pi)) 

m Q — {T* U* Tx} n {T 2 * U* T 2 } defining the interface between SNx and SN 2 . 

In other words, the subnet SNi is characterized by T, because P t =* T, U T*. The 
complement of SNx is SN 2 and vice versa. Figure 3.4 shows a single cut through a 
MG system and the corresponding subnets. The interface places are given by places 
A and M. 

Definition 3.3.2 (Projection) Let jV be a Petri net and SN { be a subnet of fif . 
Let Tl be a marking vector of fit. is the projection ofJI on SNi such that: 


Vp € Pi, M*(p) = M(p) 


(3.4) 
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Cut f 

I ^ 

/ 

10 
L 

(a) Sample Petri Net N 

A A 


(b) Subnet SNi (c) Subnet S/V 2 
Figure 3.4: Example for Single Cut Generating two Subnets 

3*4 Petri Net Subclasses and Their Properties 

In this section we introduce three well known subclasses of Petri nets (state 
machines, marked graphs and free choice nets). 

Let us first introduce a well known property from graph theory. 

Definition 3.4.1 (strongly connected [Mar71]) A directed graph is said to be 
strongly connected iff a directed path exists between every pair of vertices. 

Therefore, a Petri net is strongly connected iff a directed path exists between any 
pair of places and transitions. 
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3.4.1 State Machines 

Definition 3.4.2 (State Machine) A state machine (SM) is a Petri net such that 

WteT r*| = 1 and |f| = 1 (3.5) 

The following properties for state machines hold [SV90]: 

Theorem 3.4.1 (State Machine) 

i) SMs are structurally bounded and Y = TL. 

ii) A SM is live iff jV is strongly connected and E?=, M 0 (i) > 1. 

From a structural point of view, closed monoclass PFQNs (Gordon- Newell QNs) 
can be represented by strongly connected SMs. SMs allow choices but cannot model 
synchronization. 

3.4.2 Marked Graphs 

Definition 3.4.3 (Marked Graphs) A marked graph (MG) is an ordinary net 
such that any place has one input and one output transition. 

Vp € P |*p| = land |p # I = 1 (3.6) 

MGs allow the modeling of concurrency and synchronization but not of decisions. 
From the perspective of Petri nets, the structure of monoclass Fork/Join Queue- 
ing Networks (QNs) with Buffers (FJQN/B) [DLT90] is represented by strongly 
connected MGs [DLT90]. 

For MGs, the properties stated in the following theorems can be established 
using polynomial time algorithms: 

Theorem 3.4.2 [Mur89] Let jV , be a MG: 

i) (.V, M 0 ) is live iff all circuits (P -invariants) are marked. 
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ii) Af is structurally bounded (i.e., bounded for any M 0 ) iff it is strongly connected. 
Corollary 3.4.1 MGs are structurally live 

Corollary 3.4.2 [Cam90] The liveness of a MG can be checked by the absence of 
unmarked P-invariants. 

JBY^03Y t -C = 0 ,Y t -M o = 0, (3.7) 

where Y ^0 means Y > 0 and Y ^ 0. 

Theorem 3.4.3 [Mur77] Let (Af, Mo) be a live MG system. The three following 
statements are equivalent: 

i) M is reachable from Mq (i.e., Her such that Mq[o)M). 

ii) M is an integer solution of M = Mq + C • <t > 0, <r > 0. 

Hi) M is an integer solution of Y 7 ■ M = Y 7 • Mq for any P -invariant Y (i.e., 

by > o, y 7 • c = o;. 

In MGs, all states (in particular Mo) are home states. 

’ \ t \ * 

Corollary 3.4.3 Let (Af,Mo) be a live MG system, then Vp € P : SB(p ) = B(p). 
i.e., the structural and marking bound are the same. 

Corollary 3.4.4 In MGs, the marking bound is given by the minimum number of 
tokens in a directed circuit containing p. 

Starting from a place p, a directed circuit in a Petri net is obtained by traversing 
along the arcs and eventually returning to p. In MGs, the places in a directed circuit 
are P-invariants. 

Stochastic MGs are obtained by associating time to transitions, i.e., a tran- 
sition fires after it has been enabled for some time. Tight insensitive (i.e., for any 
probability distribution) cycle time lower bounds axe known for stochastic MGs 
[CCCS89]. 
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Theorem 3.4.4 [CCCS92] The minimum mean interfiring time (i.e., cycle time) 
for a live and strongly connected stochastic MG system can be obtained by solving 
the following linear programming problem: 


r mm = maximize Y T • Pre • s w.r.t. Y 
subject to Y t • C = 0 
Y T • M 0 = 1 


Y >0 


(LPP2) 


where s is the vector of the mean service times of the transitions. Moreover, there 
exist probability density functions for which the bound can be reached (tightness). In 
the particular case of deterministic, timing the bound is reached. 

According to the previous result, a lower bound for the mean interfiring time of 
a live and bounded MG system can be computed in polynomial time solving the linear 
programming problem (LPP2). From the theory of LPP [NRKT89], it is well known, 
that optimal solutions appear at the extremal feasible solutions. In (LPP2), feasible 
solutions are P-invariants (i.e., Y > 0, Y 7 • C = 0), and the extremal solutions 
«ire the minimal P-invariants. Moreover, minimal P-invariants define elementary 
directed circuits of the net (Theorem 3.4.3), while the constraint Y T ■ M 0 = 1 is 
just a normalization. Therefore, in order to deal with the interpretation of the 
computation in (LPP2) it is enough to observe that Y^ • Pre • s/Y ^ • Mo is nothing 
more but the sum of service times associated with the transitions belonging to the 
elementary circuit defined by the minimal P-invariant Y , divided by the number of 
tokens in that circuit. This value is just the mean interfiring time of any transition 
of the circuit if it is assumed to work in isolation. Therefore, (LPP2) computes 
the mean interfiring time of the slowest circuit considered in isolation. In other 
words, (LPP2) only gives a lower bound, because it disregards the interaction with 
other circuits (i.e., it disregards the waiting times due to the synchronization among 
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circuits). Later in this work we will use the theory of bounds to find suitable initial 
conditions for Response Time Approximation (RTA). 

As a final remark, r ,nm (in LPP2) is finite iff there exist no unmarked P- 
semiflows (i.e., directed circuits). If r mm is unbounded, then a deadlock (thus 
non liveness) is detected. Because the bound given by (LPP2) is tight, liveness, 
deadlock-freeness and the absence of unmarked circuits for bounded MG systems 
are equivalent conditions. 

Theorem 3.4.2 states that liveness and boundedness cam be established on the 
net structure and initial marking. Theorem 3.4.3 states that every integer solution 
(e.g., obtained by solving a linear system of equations) is indeed a reachable state, 
while LPP1 (Definition 3.2.8) allows the computation of the marking bound in 
polynomial time [Kar84]. 

State Machines and Marked Graphs are dual net subclasses (i.e., by exchanging 
places with transitions, the structure of MGs is obtained from the structure of SMs 
and vice versa). The reverse of a net is obtained by reversing the flow relation (i.e., 
the arcs). The reverse of a MG (SM) is a MG (SM), in other words, MGs and SMs 
are self-reverse net subclasses. Therefore the reverse-dual of a MG (SM) is a SM 
(MG). 

3.4.3 Free Choice nets 

Free choice nets (FC-nets) are a well known subclass of Petri nets which can 
model sequence, choice and concurrency. In FC-nets, choices and synchronization 
do not directly interfere with each other, i.e., they cannot coincide on the same 
transition. More formally, a FC-net is an ordinary net such that if two transitions 
have a common input place, this is the only input place of both transitions. 
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Definition 3.4.4 (free choice nets [Hac72]) Free choice nets are ordinary Petri 
nets such that 

Vp € P : \p*\ > 1 => V) = M 

The following theorem can be used to determine whether a given structurally 
bounded FC-net is structurally live. 

Theorem 3.4.5 [CCS91b] Let C be the incidence matrix of the strongly connected 
structurally bounded FC-net Af . Then the net Af is structurally live iff rank(C ) = 
m — 1 — (a — n), where m = \T\, n = |P| and a is the total number of input arcs to 
the transitions. 

Theorem 3.4.6 [ES90b] Let (Af,M 0 ) be a live and bounded FC net, then 

WpeP: B{p) = SB(p) 

Similarly to MGs, we can find sufficient conditions for the liveness and boundedness 
of FC nets based on the net structure with polynomial time algorithms. 

3.5 Stochastic Petri Nets 

In order to use Petri nets for the performance analysis of systems, the notion 
of time has to be introduced. There axe potentially two ways to incorporate time 
into a Petri net: 

i) By associating time to places, i.e., places have to be marked for some time 
before the tokens can be removed [Ram74, WDF85], 

ii) or by associating time to transitions. In the literature, two different firing 
rules have been defined: 

(a) In two stage firing, the transition absorbs the tokens and releases them 
after some time [Zub85], 
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(b) In the one stage firing rule, transitions fire after being enabled for some 
time, i.e., the firing is atomic and identical to the usual firing rule [Mol82, 
RP84]. 

By associating time to transitions, Petri net models directly represent synchronized 
QNs [Cam90], where places represent queues and transitions represent service sta- 
tions or fork/join operations. 

In any of the two models, the conflict resolution has to be addressed. In the two 
stage firing of transitions, the resolution of conflict is predetermined either determin- 
istically or stochastically by associating a routing probability to the transitions in 
conflict. In the atomic firing , conflict is usually resolved by race policy, i.e., the tran- 
sition which samples the minimum service time fires first. If other conflict resolutions 
are used (e.g., age memory) the models become increasingly complex [AMBC + 89]. 
Stochastic Petri nets (SPNs) with atomic firing can model pre-emption, while this 
is not possible for SPNs with the two stage firing rule. Due to their larger modeling 
power, in the present work we adopt SPNs with time associated to transitions and 
atomic firing rule. 

We assume that the delay associated with transition is a nonnegative con- 
tinuous random variable X{ with the exponential distribution function. 

F x ,(x) = Prpf,- < x] = 1 - exp(— A, x) (3.8) 

i.e., the probability density function (pdf) is fx, = K exp(— A,x). 

The average delay is given by: 

Si = / [1 - Fx t (s)]dx = f exp(-A,x)dx = (3.9) 

Jo Jo 

A i is called the firing rate of i,-. 

The exponential distribution is the only continuous time probability distribu- 
tion which has the memoryless (Markovian) property, 


P{X > x + a\X ><*} = P{X > x} 


(3.10) 
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It is possible to show [Mol82], that the reachability graph of a bounded SPN 
with exponential firing distribution is isomorphic to a CTMC. 

Activities which do not require any amount of time are often modeled by logical 
states in the sequence of events and can be modeled by using immediate transitions. 
In the graphical representation, immediate transitions are drawn as thin black bars, 
while timed transitions are drawn as thick white bars. The probability that the 
immediate transition t k from the enabled choice set S(M ) fires is: 

= r rt - (»•“) 
where r< € JV + is the probability rate of i,-. Subsequently we have two types of 
markings: 



• A Vanishing marking is a marking where an immediate transition is enabled. 
The GSPN leaves that state immediately. If there are no trapping loops con- 
sisting only of vanishing markings, the probability of finding the GSPN in that 
marking is zero. Vanishing markings occur frequently in the execution of the 
GSPN, and are often associated with logical states [BCR87]. For example, 
after a task is completed, a vision system will not remain idle if another task 
is waiting (it will leave the idle state immediately and become busy). In the 
construction of the CTMC the vanishing markings have to be eliminated. 


• A Tangible marking is a marking where only timed transitions are enabled. 
The GSPN has a nonzero probability of being in that state. These states are 
associated with states where some sort of non-zero length activity takes place. 
A typical example would be the downloading of an image into the memory of 
the computer. 

An important performance measure is the visit ratio of transitions. It can be 
defined as the normalized (with respect to a transition) throughput. 
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Definition 3.5.1 (visit ratio) Let X be the vector of throughputs in steady state 
and Xj its j-th entry (i.e., the throughput oftj) of the net system (J\f, M 0 ) . The 
vector of visii ratios v 3) (normalized for tj) is given by 


M) 1 v 

v “5* 


(3.12) 


If the conflicts in FC nets are modeled by immediate transitions, the visit ratios can 
be computed based on the net structure [Cam90]. It is obvious that v 3) must be a 
T-invariant, i.e., 


C-v j) = 0 (3.13) 

Let tn, ti 2 be two transitions in structural conflict (i.e., with the same pre-set) 
with probability rates r , and r, 2 respectively and X { the flow into * t ,- x . Then the 
throughput of fii and f t2 is given by: 


Xu 

X x2 


r l + r 2 

r» 

ri + r 2 


Xi 

Xi 


Dividing eq. (3.14) by eq. (3.15) and rearranging terms yields 


(3.14) 

(3.15) 


„(i) _ „(i) 

r *l v «2 — r i2 u ,l 


(3.16) 


By considering all conflicts in the net, eq. (3.16) can be combined together with eq. 
(3.13) and can be rewritten in matrix form a s 

j • = 0 vf = 1 (3.17) 

Eq. (3.17) has a unique solution iff the number of the independent rows of the matrix 
is m — 1 with m = |T|. J. Campos [Cam90] showed that the number of independent 
rows in the matrix in eq. (3.17) for subclasses of Petri nets which include FC nets, 
is indeed m — 1. 
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Figure 3.5i Modeling of Timed Structured Conflict witli Immediate Transitions 
(Firing and Probability Rates are in Parentheses) 


Consider the structural conflict modeled with timed transitions in Figure 3.5. a. 
Since the minimum of two exponential distributed variables X x and X 2 with rate 
and A 2 , min(X 1 ,X 2 ) is again an exponential distributed random variable with rate 
Amin(Xi Xi) — Ax + A 2 , the sojourn time of a token in p,- is a random variable with 
negative exponential pdf with mean 

* = ( 3 - 18 ) 

2>j=l A ij 

i.e., the rate out of p, is i^j [Mar89]. The probability that t ik samples the 
min imum time (i.e., that it will fire) is 

Prfa) = (3 «) 

Based on the previous discussion we can transform Figure 3.5 .a into an equivalent 
conflict with immediate transitions as shown in Figure 3.5.b. The steady state 
probability of place p" on being marked is zero, while the immediate transitions t*™ 
model the conflict. 


Theorem 3.5.1 LetjV be a stochastic FC net and pi be the input place of a struc- 
tural conflict modeled with exponentially timed transitions. The transformation pre- 
sented in Figure 3.5 preserves the underlying CTMC. 

Proof: It is a direct consequence of the construction rules for the underlying CTMC 
from the reachability set that includes vanishing markings. □ 
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In our model we do not allow priorities to be associated with immediate tran- 
sitions or inhibitor arcs. An alternate definition of the generalized stochastic Petri 
net (GSPN) defined in [Mar89] is therefore given by: 

Definition 3.5.2 (GSPN) The GSPN is a six-tuple 

GN = ( P , T, Pre, Post , Mo, R ) (3.20) 

(P, T, Pre, Post , Mo) is the underlying Petri net system. 

R — ( r i, r 2 . . . r m ) is the array where r± 

— is the parameter of the exponential probability density function of the 
firing delay if t l is a timed transition, 

— is a weight used for the computation of firing probabilities of immediate 
transitions if t, is an immediate transition. 

From a performance analysis point of view, we want to estimate the average 
behavior of the system in the long run. The usual assumption is that the system is 
ergodic [Ros83], i.e., in the long run the average of the observed values tend (almost 
surely) to the theoretically expected ones. In finite ergodic systems, if starting in 
state i, the expected time to return to state i is finite. 

Definition 3.5.3 [Cam90] A stochastic process Z T (r > 0 representing the time), 
is said to be ergodic iff the following condition holds: 

lim — r Z u du ~ lim E[Z r 1 < oc (3.21) 

T— *oo 7* Jq t — oc w J v y 

Not all live and bounded stochastic Petri nets are ergodic. Consider the net in 
Figure 3.2, here two closed subsets of states exist. The steady state marking will 
depend on the initial resolution of the conflict between ti and f 2 . We can circumvent 
this difficulty with the following important results: 


ORIGINAL PACS 
OF POOR QUALIFY 
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Theorem 3.5.2 [Cam90] If a Markovian bounded marked net has a home state, 
then its marking process is ergodic. 

Starting from the reachability graph and its stochastic interpretation, we can find 
the underlying homogeneous irreducible CTMC. 

3.6 Conclusions 

The practical modeling power of Petri nets is constrained by subclasses, but 
they allow more powerful analysis techniques using fast (e.g., polynomial time) 
algorithms that avoid the state explosion problem. These algorithms are based on 
establishing a relationship between the behavior of the net system and its structure, 
and are parameterized by the initial marking, which plays an auxiliary role. 

In the present work we limit ourselves to the analysis of subclasses of net 
systems whose marking process is ergodic (Definition 3.5.3) and where all states 
are home states. Thus, we only deal with systems whose underlying CTMC is 
irreducible (Theorem 3.5.2). Furthermore, we restrict our attention to subclasses 
of net systems where we can determine ergodicity of the marking process based 
on the net structure ff and an initial marking M 0 , considering only net subclasses 
where the reachability of a marking can be determined based on the state equation 
(Theorem 3.4.3) and thus all states are home states. From the above discussion, we 
now limit our attention to two net subclasses: SMs and MGs. Strongly connected 
SMs can trivially be solved because they have the structure of monoclass Gordon- 
Newell QNs and well known product form solutions can be applied. For MGs, no 
product form solution is known, therefore in the following chapter we concentrate 
our efforts on this particular net subclass. In Chapter 7, we consider MPMT-nets 
where we show that the above properties also hold. 
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CHAPTER 4 

Structural Decomposition and Response Time Approximation for 

Marked Graphs 


4.1 Introduction 

The basic approach in the presented approximation methods is to divide and 
conquer. We can identify two phases in the process: a structural decomposition 
phase (divide) and a stochastic aggregation phase (conquer). The structural decom- 
position phase consists of two steps: 


1. Structural: the system is partitioned into subsystems yielding the aggregated 
nets AAf. 

2. Marking: the marking of the aggregated nets is computed such that a notion 
of environment is kept (yielding the aggregated net systems AS). 

Based on the results of the decomposition phase, in the second phase the 
aggregated net systems subsystems are analyzed some assumptions which depend 
on the method we are using (e.g., arrival process, number of customers). 

In the following we are concerned with finding suitable assumptions (arrival 
process) for the analysis phase, while at the same time preserving some information 
about the environment of the subsystem. For MGs with deterministic timing or 
Jackson QNs, the aggregation preserves the performance measures, but in general 
the representation of the environment presents a dilemma. If we represent the com- 
plement of the subsystem better, the state space of the underlying CTMC is going to 
be larger. In the proposed approach, we try to preserve the response time (RT) of 
the subsystems exactly. As we will see, this is only possible for certain special cases, 
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while in general we only approximate the RT. Therefore, the proposed method can 

be seen as an RT approximation (RTA) [JSS92b]. 

In this chapter, we present mostly the theoretical aspects of the decomposi- 
tion. Several examples are introduced and analyzed in Chapter 6, after all of the 
aggregation techniques have been presented (Chapter 6). 

4.2 Single Cut, Aggregated nets, Basic Skeleton and Response Time 

Approximation 

The basic idea is illustrated in Figure 4.1.a: the net is split into two subnets 
by a single cut f defined through some places called interface places ( A and M m 
Figure 4.1.a) [JSS92b, JSS92a]. From the cut we define three nets: two aggregated 
nets AjV (one per subnet, Figures 4.1.b, AjV u and 4.1.C, AJV 2 ), and the basic 
skeleton (Figure 4.1.d). AAf i is obtained by preserving subnet SN t and connecting 
the interface places through a transition Tj representing the reduction of the other 
subnet SN,,j ± i. Transitions n (Figure 4.1.c) and r 2 (Figure 4.1.b) represent the 
aggregation of subnets SN X and S1V 2 , respectively. The basic skeleton (i.e., the fully 
aggregated net, Figure 4.1.d) is obtained by maintaining the interface places, while 
reducing each subnet to a single transition. For a single cut, the basic skeleton always 
has the same structure, i.e., it is independent of the topology of the stochastic MG 
under analysis. The markings of AJVi,AjV 2 (leading to the aggregated net system 
ASi = ( AJVi , Ml)) and the basic skeleton are computed based on the initial marking 
of the original net. 

Assuming single server semantics for the aggregated transitions r representing 
the subnets, the basic skeleton will be interpreted as a classical M/M/l/K queue, 
where I< represents the number of customers in the system (and therefore also the 
maximum queue size). Therefore, the throughput X of the basic skeleton can be 
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SNi Cut f SN 2 



(a) 

f f 



(d) Basic Skeleton System 


Figure 4.1: Aggregated Nets (b+c) and Basic Skeleton (d) Systems Obtained from 
the Indicated Single Cut to the Marked Graph (a) 
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written as [GH85]: 

* = <r 75) ^ 

where p = X/p is the ratio between the rates A and p of the aggregated transitions. 
With the above presentation we devise a divide and conquer approach to compute an 
approximation of the throughput: it should be obtained through an iterative scheme 
using both aggregated net systems and the basic skeleton system. Assuming an initial 
guess for one of the aggregated services (e.g., transition r 2 ), the throughput of *4<Si 
(Figure 4.1.b) is computed (solving the underlying CTMC) and a value for the 
service T\ is determined such that the basic skeleton has the same throughput (using 
equation (4.1)). In this way we approximate the RT of SNi. The value of the service 
time of Tj, is then substituted into the other aggregated net system (Figure 4.1. c) 
and the throughput is computed (solving the underlying CTMC). A new value for 
the service time of r 2 is determined such that the throughput of the basic skeleton 
and AS 2 are equal. The schema is iterated until a reasonable convergence of the 
throughputs is obtained. 

The goal of the above presentation is just to convey the basic idea. The 
next sections formalize the technical aspects of single cuts. First in Section 4.3 
the fundamental single input/single output (SISO) cut is characterized. The other 
possible single cuts are represented in Section 4.4 (SIMO/MISO cut) and Section 4.5 
(MIMO cut). 

4.3 Single Input/Single Output (SISO) Cut 

This section is devoted to the characterization and analysis of the case where 
a net is split into two subnets in such a way that the interface is defined by a subset 
of input or output places of a unique transition and by a subset of input or output 
places of another (eventually the same) transition. 
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Case 1 



Case 2 

Figure 4.2: Canonical Transformations Presented on the Interface 7 tJ (from SN { to 
SNy. tjj is an Immediate Transition) 

4.3.1 SISO and Canonical SISO cuts 

The idea of the single cut was formalized in Definition 3.3.1. As all places in 
MGs are persistent, it is possible to further characterize the interface: 

Definition 4.3.1 (Interface) Let f be a single cut traced through a set of interface 
'places Q ( Definition 3.3.1). Q = Q 12 U Q 21 where Qij is the subset of places defining 
the interface from SN X to SN jt i.e., •<?,, C T, and <?* Cl), t is a source (sink) 
transition of SNi iff t 6 T, and t € *Q tJ ( t € Q* t ). 

Eventually source and sink transitions will be the same. 

Definition 4.3.2 Let £ be a single cut producing subnets SN X and SN 2 , connected 
through the subset of interface places Q 12 , Q 21 C P. Furthermore, let Ijj be the 
condition that the interface from SNi to SNj has to satisfy. is SISO if: 

hi • 3fi2 € T\: Qi 2 C *J 2 or 3*' 12 € T 2 :Q X2 C 1t) 2 
hi • 3*21 € T 2 :Q 2 i C * 21 or 3* 21 € T\.Q 2 \ C ^t 21 
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Both subconditions in 7 ti are illustrated in Figure 4.2. Moreover Figure 4.2 
illustrates a canonical transformation to deal with the case where we have multiple 
places interfacing to the same pre-set (post-set) of a transition (i.e., when | Q i2 |> 1 
or | Q 2 i |> 1)- In the transformed net each interface has only a single place. We 
introduce an immediate transition t{j and replace the interface Qij by a single place 

The canonical transformation preserves the underlying CTMC (only vanishing 
markings are introduced). Conceptually we could transform any SISO cut into a 
canonical SISO cut (i.e., through two different interface places, 512 and q 2 1 which 
connect SN% to SN 2 and vice versa), but the canonical transformation increases the 
vanishing state space. Therefore, we would not introduce the immediate transition 
Uj in Figure 4.2 for the generation of the CTMC. In order to simplify notation and 
statements, in the rest of this section we assume without loss of generality, that 
the cut is canonical (i.e., done through a single input place and a single output 
place) or that the canonical transformation is done whenever necessary (even if only 
conceptually, because of the increased vanishing state space). Therefore, in the rest 
of this section | Qi 2 |=| Q 2 i |= 1, Qi 2 = {912} and Q21 — {?2i}- 

Property 4.3.1 (SISO cut) 

i) It is possible to check whether a cut is SISO is in linear time. 

ii) The canonical transformation preserves the CTMC associated to the original 

system. 

4.3.2 Qualitative Aspects 

The process of obtaining aggregated net systems .45 (Figure 4.1), can be 
viewed as a reduction of the net system [BerSo, Sil85|. Ideally speaking, we look for 
a reduction rule preserving the performance indices, in particular the throughput. 


Rules preserving the throughput will be very limited. If the throughput cannot be 
preserved, what property should be reasonable to preserve in such a way that our 
iterative scheme yields a “good” approximation for the throughput? We decide to 
preserve the maximum number of tokens in the non-aggregated places (i.e., its mark- 
ing bound). This corresponds to the idea of preserving the number of customers of 
the original system to its abstract views: the aggregated nets and the basic skeleton 
system. The preservation of the number of customers will imply the preservation of 
liveness for MG systems (remember that non-liveness is equal to null throughput!). 

The work will proceed in two steps: First the qualitative aspects will be stated 
in this section using known facts from the structural theory of MG systems, and 
later the quantitative aspects will be addressed (Section 4.3.3). 

Once a SISO cut has been decided on by the analyst, the transformation of 
the original net system into an aggregated net system can be viewed as a reduction 
process in which one subnet subsystem is aggregated into a single transition pre- 
serving the interface places and the other subnet subsystem. The marking of the 
remaining places has to be computed. 

Let JV = (P, T, F) be an MG and a canonical SISO cut through places q l2 
and q 21 defining two subnets SiV,- = (P,, 7),/)) with i = 1,2, where Q = {?i 2 ,? 2 i}- 

The original net system (JV , M 0 ) leads to two aggregated net systems, 
ASi = (AV i, Mg) i = 1,2, and the basic skeleton system. Structurally speak- 
ing from JV , SNj is reduced to r ; in AV, ( here AV refers to the structure, while 
AS refers to A\l plus some initial marking). The decomposition process is formal- 
ized by the following rule. 

Rule 1: SISO Decomposition Rule 

1. Structure: AV , = (P„ P,), i * 1,2 is an aggregated net, where: 
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• fi = T, U {Tj} 

• Fi = fi U ( 9y x Tj) U (Tj x qji); (=> ?y = = rj) 

The basic skeleton is just the cycle {gn —* t 2 — ► 921 — ► r i — *■ 912 } 

2. Marking: 

Let ~M be an optimal solution to the following linear programming problem: 

maximize M{qn) 

subject to M = M 0 + C • cr >0 (LPP3) 

a > 0, ~M > 0 

where C is the incidence matrix of the canonical MG obtained from the original 
net and cut. The initial markings for the two aggregated nets and basic 
skeleton are the projection of M: 

• for A/Vi : Ml(p) = IS 7(p), Vp € Pi 

• for the basic skeleton: M^qu) = 0 and K = A/jJ(g 2 i) = M(q 21 ) 

Property 4.3.2 (SISO Reduction Rule) Let (jV,M 0 ) be a live MG. 

1. For any reduction (from the original net system to an aggregated net system 
or to the basic skeleton system ) the bounds of the places are preserved: B{p) = 

B(p), Vp € Pi 

2. Liveness in both aggregated net systems and the basic skeleton system is pre- 
served. 

Proof: It is a direct consequence of the reachability and liveness theorems for 
strongly connected MGs (see Section 3.4.2), The transformation of the marking 
empties the least-loaded path in the subnet which is being reduced, therefore we 
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M{t { ) K customers 


HD - 1 

M(G) 

(c) Reduction to M /M(G)/l/K 

Figure 4.3: Abstraction of AS and Throughput Preserving Reduction 


maintain the marking bound of places. Thus, the liveness of the system is pre- 
served. ■ 

Moreover, it is not difficult to realize that for any optimal solution of (LPP3) 
•^( 912 ) = 0. This is true because q\ 2 and 521 belong to the same set of cycles (P- 
semiflows). Thus, the maximization of Jt(pij) implies the minimization of ~M(p Jt ). 

Because LPPs are of polynomial time complexity, polynomial time algorithms 
lead to the aggregated net systems and to the basic skeleton systems. 


4.3.3 Quantitative Aspects 

During the RTA aggregation, we replace a subnet SiV,- by an equivalent station 
(transition r.) with single server semantics. The response time of r t in isolation 
under an assumed arrival process approximates the response time of the replaced 
subsystem (i.e., subnet plus marking). In the aggregated net system AS Jf the 
aggregation of subnet SiV, (represented by r.) is the generator of the arrival of 
tokens to the subnet SNj, i ^ j. The service time associated with the transitions 
T ij s ( T i) = 1 /Mii * — 1,2 represents an approximation of the mean completion time 
(MCT) of SiV, in the original MG system. 

An interpretation of the system ASj (Figure 4.3.a) is the classical M/G/l/K 
queue, shown in Figure 4.3. b. Here we know the mean service time of the exponential 
server (transition tv), the number of customers in the system I\ = M (qu) + M (q 2 i) = 
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M{q 21 ), and the throughput X of the system. Then our goal is to find an equiva- 
lent server for G (where G is a general distribution function which summarizes the 
behavior of the subsystem {SNj, Mq)). 

One possible solution is to approximate M/G/l/K by an M/M(G)/l/K queue 
(Figure 4.3.c), preserving the throughputs the original system (here M(G) expresses 
the fact, that the general server G is approximated by the exponential server M(G)). 
Then the problem at hand is the following: with the information from the analysis 
of the M/G/l/K queue (population I<, interarrival rate A and throughput X), we 
have to find the service rate p of M(G) such that the throughput of M/G/ 1/ K and 
M/M(G)/l/K is the same. Equation (4.1) can be rewritten as: 


X = A(1 - 



(4.2) 


It is easy to see that X < A, because the throughput cannot be higher than 
the fastest firing rate of a transition. Rewriting equation (4.2) as polynomial and 
defining 0 = X/X, 0 < 1, we have to find the positive solution of the following 

polynomial: 

V hr i-/» 

The polynomial f(-) has a unique positive real solution because: 


(4.3) 


m 

d/(i) 

<*(;) 


0 

1-/3 


< 0 


A' i . i 

V'i(-)*- 1 >0, V- >0 

h V p 


(4.4) 


(4.5) 


i.e., the function is strictly increasing in l/p. The unknown firing rate is obtained 


by setting p = A/p. 

With this mapping of a M/G/l/K to a M/M{G)/l/K queue, we maintain the 
response time of the system when K = 1 because the steady-state probabilities of an 
M/G/c/c and an M/M/c/c queue are the same [GH85]. Therefore, the throughput 
and the response time of both queues are the same if A = 1. W hen A > 1, we onK 
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approximate the RT. Note that in the original work of RTP, the isolated subsystem 
wets analyzed under an open arrival process, while here we analyze the subsystem 
as a closed system. 

We can define the following functions: 

function solve J>asic.skeleton( A, X ) 

{Solves basic skeleton with unknown rate /x} 
input: A: known arrival rate 

X: throughput 
returns: /x 

function solve-aggregatecLsysternfa, k) 

{Solves the underlying CTMC of ASk} 

input: k: indicates system to be solved 

/x: firing rate of aggregated transition r 
returns: X \ 

function tesLconvergence(X new , X 0 u) 

{checks whether a convergence criterion has been achieved} 
input: X new \ most recently computed throughput 

X 0 id: previously computed throughput 
returns: TRUE if convergence has been achieved 

FALSE otherwise 

Algorithm 1: Quantitative analysis of SISO cuts 

1) Select a cut f 

2) Generate the aggregated net systems AS i and AS 2 and the basic skeleton 
system using the Decomposition Rule. 
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3) Initial Value computation: Determine an initial value of the firing rate of r x , 
fjQ (belonging to AS 2 ). A reasonable initial value can be obtained as follows: 

Let r 2 be an immediate transition (i.e., *t 2 = oo) in -4S X . 

A •■= l/Tu 

where T x is given by (LPP2) (Theorem 3.4.4). 

X° := solve. aggreg at ed^ystem(}i°, 2); 

4) r := 0; {iteration steps} 

5) repeat 

r := r + 1; 

^ 2 _1 := solve J}asic^keleton((i\~ x ,X^~ l ); 

X[ := solve-aggregatecLsysteTn(n 2 ~ l ? k); 

convergences test.convergence(X [ , l ); 

rf convergence = FALSE then 

:= solve J>asicskeleton(n T ^ 1 -,Xl); 

X{ := solve-aggregatedsystem(fi[, 2); 
convergence := test. convergence (X { , ^ x r ); 

end if 

until convergence = TRUE 

Algorithm 1 implements an iterative method. Thus, the existence and unique- 
ness of the solution, and the convergence of the method should be addressed. Ex- 
tensive tests have shown that for all cases considered, there exists one and only one 
solution, which is typically computed between 2 and 6 steps, until the difference be- 
tween the two last estimations of the throughput is less than 0.1% (the convergence 
criterion is computed in the function test.convergence). 

Although experimental convergence does not seem to depend on the initial 
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values for the iteration, “good initial values” can reduce the number of iterations, 
thus the practical complexity. The throughput of deterministic timings is frequently 
relatively close to the case of exponential timings, and its computation is polynomial 
(using Theorem 3.4.4). Therefore, the choice of the initial value of r x is justified. 
Because of shorting out and the assumption of deterministic time, the initial value 
is an optimistic initial value. From our experience, the cycle time of the subnet in 
short circuit is in many practical cases within 20% of the actual value. 

4.4 Single Input/Multiple Output (SIMO) Cut 

This section considers a class of cuts (disjoint with respect to the SISO class) 
where the places of one and only one of the two interfaces does not belong both to 
the pre-set or the post-set of a single transition. 

The output interface of SN 2 in Figure 4.4.a, is a multiple interface. 

More precisely stated, the pre- and the post-conditions of the interface Q 2X have two 
transitions: {£ 3 , £ 4 } and {£ 5 , £ 5 }, respectively. The input interface leading to SN 2 is 
defined through a single place p^. Thus, the cut is SIMO with respect to SN 2 . On 
the other hand, if SN X is taken as the reference subnet, the same cut is Multiple 
Input/Single Output (MISO, the symmetrical case). Therefore, in the rest of this 
work we refer to these types of cuts as SIMO cuts. 

Definition 4.4.1 £e££ be a single cut producing subnets SN X> and SN 2 , £ is SIMO 
with respect to SN 2 ( MISO with respect to SN X ) if: 

I\ 2 : 3£i2 € T X :Q X2 C £* 2 or 3£ 12 € T 2 :Q X2 Q*t 12 {single interface} 

-^21 • ^£21 € T 2 :Q 2x C £ 21 and fit 2x € T X :Q 2X C* £ 21 { multiple interface} 

Comparing Definition 4.3.2 and 4.4.1, it is clear that a SIMO cut simply 
negates the second condition of the SISO cut. Thus, SISO and SIMO are disjoint 
classes of cuts. 
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(a) Original net system and cut 



(b) Introducing the shadow transition t s associated to the cut £ 



Figure 4.4: SIMO Cut and Aggregated Net Systems 

We already know how to deal with the SISO case. Thus, an idea is to transform 
the underlying net structure in such a way that the resulting subnets have SISO 
characteristics. The necessary transformation on the net structure leads to the 
introduction of a single immediate shadow transition ts- 

Definition 4.4.2 (shadow transition) Let Q l} be the set of multiple interface 
places generated by a cut £. The duplicate interface places Rij, and the shadow 
transition t s are generated as follows: 

Vqij € Qij, * r'j = Vij 

t « l • 

3 lJ ~ Qij 


Figure 4.4.b shows the introduction of a shadow transition t s leads to the 
duplication of the interface {ps,pr} creating the places {r 5 ,r 7 } and {s 5 ,s 7 }. Then 
we have: 

ts = { r 5i r 7}» tg = { 35 , S 7 } 

The cuts defined now through q l2 = {pa} and Q 21 = {r 5 ,r 7 } or Q 2 1 = {s 5 , s 7 } 
are SISO. From a structural point of view, the introduction of a shadow transition 
means the introduction of a new synchronization constraint. In general, the presence 
of the shadow transition creates new elementary cycles. In our case: 

( Q ) Pz — ► tz r 7 ts S5 —* t 5 — > p 6 — *• t 6 — *• p s — ► t 2 — *■ p 3 

(/?) Pz tz —* Pa t 4 — *• r 5 —*■ t a —* s 7 -+ f 6 — ► p s — > t 2 —*■ p 3 

axe new cycles. They lead to the following P-in variants: 

(a) M{p z ) + M(r 7 ) + M(s s ) + M(p 6 ) + M(ps) + M{p 3 ) = K a (M 0 ) 

(/?) M(pa) + M{p A ) + M(r 5 ) + M[s 7 ) + M(p s ) + M(p 3 ) = K 0 (M O ) 

where A a (A/o) and I\ 0 (Mq) is the number of tokens in the newly created P- 
in variants. 

Using the standard reduction approach on the SISO cut in Fig 4.4. b, the 
aggregated nets in Fig. 4.4.c are obtained. Observe that both can easily be derived 
from the original net. In our presentation we introduce the transformed net with 
the shadow transition to highlight the presence of the additional synchronization 
constraint. 

Now we have to choose appropriate initial markings for the aggregated nets 
(Fig. 4.4.c). To better understand the problems, let us go back to the original net 
system, and the transformed net with the shadow transition. One solution could be 
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derived by maxi mizing the number of tokens in the places of the multiple interface 
of the original net system (i.e., {ps,??})- Such a solution is, for example: M T = 
(1,0, 0 , 0 , 2 , 0,1, 0,2,0). Observe that M{p $ ) = B(p s ) = 2, M(p 7 ) = B{p 7 ) = 1 and 
M(p 3 ) = 0. 

The following property formalizes the previous discussions: 

Property 4.4.1 Let (jV, Mf) be a live MG system, f a SIMO cut with the single 
interface q X 2 and the multiple interface Q 21 • Any solution of the marking of the 
multiple interface places: 


max EpeQn M(p) 

subject to M — Mq + C • 0 >0 (LPP4) 

a > 0, M >0 

is such that Vp € Q 211 then M(p) = B(p) and — 0. 

With a solution of LPP4, the number of tokens in the places belonging to the mul- 
tiple interface is their marking bound. 

Proof: It is easy to see that any elementary circuit through q x2 should go through 
one and only one place of the interface Q21 • Thus, reasoning on the P-invariants gen- 
erated by these circuits and considering the reachability theorem (Theorem 3.4.3), 
it can be concluded that maximizing the marking on Q 21 leads to M(q x 2) = 0. 

On the other hand it can be proved that there exists no elementary circuit 
containing two (or more) places of Q 2 V from p € Q 21 we should reach <712 before 
returning to any place of ^21* Thus, a circuit through two places of Q 21 should go 
twice through q X 2 and the circuit is not elementary. 

The elementary P-invariants generated by the circuits containing places of Q21 
have one and only one place of Q21 (i- e -j they do not interfere). Thus, maximizing 
the sum of tokens is equivalent to maximizing the number of tokens in each place 
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of Q 2 1- The result follows because of the reachability theorem (Theorem 3.4.3). ■ 

Once a solution of (LPP4) has been computed, the problem of where the tokens 
should belong to once the structure of the net is transformed arises! 

• Case a. tokens into £5, i.e., into the pre-set of the shadow transition, 
M>(rs) = 2 , M 0 (r 7 ) = 1, M 0 (s s ) = M 0 (s 7 ) = 0. 

• Case b. tokens into t^, i.e., into the post-set of the shadow transition, 
M 0 (r s ) = M 0 (r T ) = 0, M 0 (ss) = 2, M 0 {s 7 ) = 1. 

In the original net system we have the following marking bounds: 

B( Pi ) = B( Ps ) = 2; B(p 3 ) = B( Pt ) = B( Pa ) = 1 . 

After the computation of the new markings we have the following changes in the 
marking bounds: 

Case(a), tokens in pre-set of t $ : 5(s 5 )=l , s s € SN\ 

Case(b), tokens in post-set of ts : f?(p 4 )=l, p 4 € SlV 2 . 

Because of the shadow transition, in both cases the behavior of the resulting net 
system is different from the original. A closer examination shows that computing 
the marking according to (a), we only preserve the marking bound of the places 
in SN 2 . On the other hand if we compute the marking according to (b) only the 
marking bounds of the places in SNi are preserved. 

In case (a) ((b) respectively) the initial marking for the transformed net with 
the shadow transition preserves the bounds of the places belonging to SN 2 ( SN t ) if 
tokens corresponding to the original interface places Q 21 are deposited in the pre-set 
(post-set) of the shadow transition, *t a = {r} (t* = {5}). 

For the computation of the two aggregated net systems and the basic skeleton 
system we do not need to introduce the shadow transition explicitly (after all it leads 
to an increase of the vanishing state space). The process leading to the aggregated 
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net systems and basic skeleton system can be expressed directly by means of the 
following decomposition rule. In this way, all the marking bounds are preserved. 
The effect of its application is shown in Fig. 4.4.C. 


Rule 2: Multiple Interface Decomposition Rule 

1. Structure: 

(a) A/Vi = (P„ T„ Fi) i = 1,2 is an aggregated net, where: 

• Pi = Pi 

• fi = Ti U {rj} 

• Fi = Fi U (Qij x Tj) U (t : x Qjt), (=* *Tj = Qij, tj • = Qji) 

(b) The basic skeleton is just the cycle {^12 — >■ r 2 — >• 921 + * ?i2}> where 

qij now represents the subset of places Qij. 

2. Marking: 

Let ~M be a solution of the following linear programming problem on the 
original net system (a generalization of (LPP 3 )): 


~M = maximize E21 • M 

subject to M = Mo + C • a > 0 
a > 0 


(LPP 5 ) 


where £ 2 1 is the characteristic vector of the multiple interface, Q 2 1, i.e.: 


•^21 (?) — 


1 if p € Q21 
0 otherwise 


The initial marking of AjVi, i = 1 , 2 , is 1 V/q, the projection of M on AV"i, i.e., 
M'q(p) = M(p) Vp € Pi, considering that places r k and s k represent p k of the 
original net. The initial marking for the basic skeleton is: 

Mo(qn) = 0 and M 0 (q 2 1) = = 
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The multiple interface reduction rule is a generalization of the SISO rule (Rule 
2). For SISO interfaces both rules lead to the same transformation. We can identify 
the projection of the marking vector on AN\ as case (a) (tokens in *t 5 ), while the 
projection on AN \ corresponds to case (b) (tokens in t§). 

Property 4.4.2 Let (N,M 0 ) be a live MG system, £ a SIMO cut on N, and 
(^m’i -Mo); 1 = 1>2, the partially aggregated net systems obtained by applying the 
Multiple Interface Decomposition Rule, then: 

1. B(p ) = Bi(p) Vp 6 Pi. 

2. {AN i, M-q), i — 1,2 is live. 

Proof: 

1. Any circuit of AN ,• is the projection of a circuit of the original net (observe that 
all the new circuits introduced by the shadow transitions disappear with the 
subsequent reduction processes). The result follows because of the reachability 
theorem and the P-invariants induced by the circuits. 

2. The bounds being preserved, all the circuits of the aggregated nets should be 
marked. Therefore, the aggregated net systems are live. ■ 

In order to approximate the throughput, the initial value computation and 
iterative schema of the SISO case (see Algorithm 1, steps 3 and 4) can now be 
used with the two aggregated net systems and the basic skeleton system. Now the 
number of customers K = B(qi 2 ), i.e., K is given by the marking bound of the 
single interface place, g 12 . 

It can easily be verified that the two aggregated net systems obtained with the 
Multiple Interface Decomposition Rule can be derived from the same transformed 
net (with the shadow transition, Figure 4.4. b) but for two different initial markings 
(i.e., two different net systems). 


59 


4.4.1 Alternative Interpretation of the SIMO cut 

Another way of interpreting the SIMO decomposition is as follows: Is it possi- 
ble to reconstruct the original net system from the two aggregated net systems AS\ 
and AS 2 obtained with the Multiple Interface Decomposition Rule? As we will we 
see not, because we have made a structural change by introducing the shadow tran- 
sition. It is possible to construct a net system where a SISO cut leads to the same 
aggregated net systems as with the SIMO cut. Let the initial marking of the net 
shown in Figure 4.4.b be M(s 7 ) = M{r$) = M{p\) = 1 and M(s$) = M(pf) = 2. 
The SISO cut through the places s 7 ,s 5 and p 3 together with the canonical trans- 
formation also leads to to the aggregated net systems shown in Figure 4.4.C but its 
behavior is completely different from that in Figure 4. 4. a. Here we have introduced 
the shadow transition and added tokens until the marking bound of all places is the 
same as in the corresponding places of Figure 4.4.a. 

Theorem 4.4.1 Let (j\f, iV/ 0 ) be a MG system with the SIMO cut £ defined through 
the single interface and the multiple interface Qji and W 7 the net structure after 
introducing the shadow transition. The SISO cut £* through {j\f , M 0 ) with the 
interface places R : i or Sji produces the same aggregated net systems as the cut { 
in (jV, M 0 ) . Mq is given as follows: Let M(p) be a solution of LPP5. Mq is the 
projection of M{p) on P' considering that Rji now represents Qji- Furthermore, let 
k be the minimum number of tokens in the interface Rji, i.e., 

k = min M(p) (4.7) 

The marking o/5 ;i - is given by 

V^eSji. Af(4) = Mir',) - k (4.8) 

Proof: We have to show that the marking bound of the places in (A/*', Mq) is the 
same as in ( jV , Mo) ■ 
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1. The bound of places in SNj cannot have decreased because we have at least 
maintained the number of tokens in each directed circuit by using the projec- 
tion of the marking vector of LPP5 (i? Jt is now part of SNj). 

2. The maximum number of tokens in a directed circuit in SNj is the same 
because there exists at least one path in SN{ (between q i} and Sji) which is 
empty. By using corollary 3.4.4 we have not increased the marking bound of 
the places in SNj. 

The same reasoning holds for SN { , because by firing t 3 the situation of SN : and 
SNi is now symmetrical and the same argument holds. ■ 

The above reasoning is conceptual, because the aggregated net systems can 
be obtained directly from the original net system by elementary net reduction rules, 
specifically the macrotransition reduction rule (the reverse-dual of the macroplace 
reduction rule presented in [Sil85]). Nevertheless, when interpreting the results gen- 
erated by a SIMO cut, it is important to remember that a structural transformation 
has taken place. 

Because of the shadow synchronization constraint (a pessimistic transforma- 
tion) and the change in marking by Theorem 4.4.1 (an optimistic transformation), 
the global estimated performance may be pessimistic or optimistic. 

4.5 Multiple Input/Multiple Output (MIMO) Cut 

The third (and last) class of single cuts is obtained when the places of the 
input and the output interface do not belong to the pre-set or the post-set of a single 
transition. In other words, several (multiple) transitions define both interfaces. 

Definition 4.5.1 Let £ be a single cut producing subnets SN X and SN 2 , connected 
through the subsets of interface places Qu,Q 2 i C P. £ is MIMO if: 

1 12 •' A 2 € T X :Q U C 2 and jSt l2 € Ti'-Qn C*£ 12 
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Figure 4.5: A Bad MIMO Cut and Two Possible Interpretations of the Net System 
Transformation 

7 21 ; jBtn € T 2 :Q 21 C f 2l and € T X :Q 21 CU 21 

Comparing Definitions 4.3.2 and 4.5.1, it is clear that a MIMO cut negates 
both conditions of a SISO cut. SISO, SIMO and MIMO form a complete set of the 
disjoint classes of single cuts. 

MIMO cuts can be much more problematic from a performance analysis point 
of view. In general, if we apply the technique presented previously, we do not 
maintain the meaning of the original net system. We will illustrate the previous 
statement with two examples. 

The net system shown in Figure 4. 5. a (without place tt) is a monomarked 
(i.e., marked with a single token) elementary circuit. The MIMO cut presented 
is bad from any point of view: by making a structural transformation analogous 
to that introduced in Section 4.4 (i.e., introducing two shadow transitions, one per 
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interface), the net in Figure 4.5. b is obtained. New circuits have been introduced into 
the net system, which is now obviously dead. Adding a token to p 3 , the net system 
becomes effectively live, but the behavior of the transformed system has nothing 
to do with that of the original system (e.g., compare the possible firing sequences). 
Another interpretation for the reduction process is illustrated in Figure 4.5.c where 
the behavior (taken as the firing sequence) is preserved, the only difference being 
that instead of firing a or d, an aggregated transition ad is fired. There are two 
problems associated with the second interpretation of the aggregation process: 

(1) The aggregated net depends on the behavior of the original system and 

(2) The aggregated net is not an MG. 

What is the problem with the MIMO cut in Figure 4. 5. a? The answer is quite 
simple: an elementary circuit has been cut in more than two places (four to be more 
precise, because the number of cuts on a circuit to split a net into two parts must be 
even). Doing that, two subcircuits of the same circuit axe forced to be aggregated, 
something that is functionally meaningless. 

In another way the MIMO cut of Figure 4.5.a is also problematic: we are “ag- 
gregating” two disconnected subnets pi,t a ,p 4 and pi,t d ,p 5 and from a performance 
point of view this may be dubious. Nevertheless, we can add an implicit place t to 
the original net ( V = t a and ir* = tf). Because tt is implicit [CS90], this new place 
does not change the behavior of the original system, and the subnet to be aggregated 
is now connected. But if we reduce the subnet we will change the parallel behavior 
( || t d ) in Figure 4.5.b to a sequential behavior ( t a o f d ). Therefore, two net 

systems (with and without the implicit place k) have identical behaviors, but lead 
to two transformed net systems with quite different behaviors! We can alleviate the 
problem by introducing the notion of well formed cuts. 

Definition 4.5.2 (Well Formed Cut) A MIMO cut £ is said to be Well Formed 
if no elementary circuit is cut more than twice. 
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Given a cut on an MG, it can be easily characterized as being well (or ill) 
formed. Let be the arc weighted graph obtained from the MG considering the 
transitions as nodes and the places as arcs. Arcs are labeled with weight 1 if the 
corresponding place belongs to the interfaces defined by the cut, and 0 otherwise. 
Thus, the following can be concluded: 

Property 4.5.1 The MIMO cut £ is well formed iff the maximum weighted circuit 
of has weight two (i.e.,no circuit is cut more than twice). 

The problem of computing a maximum weighted circuit [Deo74] has polyno- 
mial time complexity, therefore, well formed cuts are characterized in polynomial 
time. 

The transformation of the MIMO cut into a SISO cut leads to the introduc- 
tion of two shadow synchronizations: one for the input and another for the output 
interface. Figure 4.5.b shows the structural transformation. Of course the shadow 
transitions can introduce many new circuits. 

The process leading to two aggregated net systems and the basic skeleton 
system can be directly expressed by Rule 2, Multiple Interface Decomposition Rule. 
The two aggregated net systems produced by f in Figure 4.6.a are represented in 
Figure 4.6.c. The following are two interesting observations on our example. In the 
transformed net with the shadow transitions: 

• Places P4 and rj (representing p \ ) in *4<S| are now 1-bounded, while they were 

2- bounded on the original net system, i.e., now the “number of customers is 

not preserved. 

• Place s 3 is implicit [CS90] and the behavior is not changed if we delete it. 

The definition of the multiple interface decomposition rule emphasizes maxi- 
mizing the number of tokens in the multiple interface. As the cut is now MIMO, 
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SNi «: SN 3 



(a) Original net system 


(b) Transformed net structure 
with shadow transitions 



Figure 4.6: Example for MIMO Cut: Marking Bound Changes 

both interfaces are multiple and a solution for M can be obtained by maximizing 
the number of tokens in Q 2 \ or in Q u - 

The situation is now somewhat more symmetrical, what happens if we maxi- 
mize the marking of the other multiple interface, i.e., E 12 - ~M = H p€ g n iV/(p)?. 

The solution for ~M is now ~M = iV/ 0 , thus: 


• Af(r x ) = 2, M(r 2 ) = 1, M(s 3 ) = M(s 6 ) = M(p 4 ) = M(p?) = M( Pa ) = 
M(p 9 ) = 0. 

• Af(*i) = 2 iV/(s 2 ) = 1, M(r 3 ) = iV/(r 6 ) = M{p s ) = 0. 

The new initial marking for AX r 2 is reachable in AS 2 , therefore, basically 
the situation for AS 2 has not changed. Nevertheless, ,4<Si has an initial marking 
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such that the bound of all places except s 3 is preserved! Moreover, even if B(s 3 ) is 
different from B(p 3 ) it is easy to see that s 3 is implicit [CS90] and can be removed 
without affecting the behavior! In other words, optimizing the marking on Q i2 or 
on Q 2 i has different effects on the behavior of the aggregated net systems! 

Property 4.5.2 Let (Af, Mo) be a live MG system, £ a well formed MIMO cut on 
J\f, and (JV 5 , Mo) the net system obtained applying the multiple interface decom- 
position rule. Then (JV*, M.q) is live, but in general the bounds of places are not 
preserved. 

Proof: By maximizing the number of tokens in the output interface, we guarantee 
that each circuit is marked (the new circuits as well as the old ones, because each 
new elementary circuit has to include one input and one output interface place), 
therefore, the transformed net system is live. ■ 

Anyhow, approximate quantitative results are obtained by applying the ini- 
tial value computation and iteration schema of Algorithm 1 to the aggregated net 
systems and the basic skeleton system obtained with the multiple interface decom- 
position rule and maximizing the marking on Q 21 or on Q 12 • K is now the minimum 
number of tokens in the circuits which have been cut. 

From the above discussion the following property follows: 

Property 4.5.3 The marking bound ofASi, i = 1,2 is preserved, if the number of 
tokens in each cut cycle is the same. 

4.6 Multiple Cuts and Response Time Approximation: Hierarchical De- 
composition 

In Section 4.2 the fundamental ideas for an approximation technique based 
on single cuts have been introduced. Sometimes, the decomposition of the system 
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by a single cut into two subsystems is not sufficient, because we cannot solve the 
underlying CTMC’s of the aggregated subsystems in a reasonable amount of time. 
Once again we have to look for other approaches to cope with the problem of state 
space explosion. 

The basic idea is to consider not only a single cut, but to introduce several 
cuts. We consider that multiple cuts can be implemented in a recursive manner, 
leading to a hierarchy of subsystems which are structured in several levels. 

In this approach we assume that the MG is decomposable into subnets by 
more than one single cut. In general, this assumption is not true, when we consider 
only a very restricted class of cuts like the SISO cut in this work. However, when we 
consider all possible kinds of single cuts (SISO, SIMO or MIMO) this decomposition 
is always possible. 

Figure 4.7 illustrates the idea of the hierarchical decomposition. At each level 
the nets are recursively partitioned into two subnets by a single cut until the size of 
the state space of the resulting aggregated net systems is sufficiently small. Dashed 
transitions represent the aggregation upon which the service times are iterated at 
the current level. The approximate value of the throughput is computed in a hi- 
erarchical manner. The CTMCs associated with the aggregated net systems are 
computed at the lower level of the decomposition when they are small enough (sys- 
tems v4iSi!, AS\ 2 , -4<S2i AS 2 2 in Figure 4.7). 
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At each step of the recursive decomposition, the analyst must define a cut 
for each subnet, and evaluate the new set of aggregated subsystems whether it is 
possible to solve the associated CTMCs in a reasonable amount of time. Since 
an accurate estimation of the size of a CTMC is difficult (although a promising 
approach is outlined in [WD92a, WD92b]), the analyst should define when the 
CTMC is “small enough” to be solved analytically. Moreover, if an additional 
decomposition is needed, the analyst should define the new cut. Note that in this 
decomposition method the number of subsystems whose associated CTMC has to 
be solved is doubled with each decomposition. 

We are now going to describe, in an intuitive way, the hierarchical decompo- 
sition method with the example presented in Figure 4.7. Because of the difficulties 
associated with MIMO cuts, in the following we only consider SISO and SIMO cuts. 

Consider the original system denoted by AT at the top of Figure 4.7. Then 
we choose a cut B through the places pj and pi 7 . This cut satisfies the SISO cut 
conditions. After the first cut is defined, we apply the decomposition process for a 
single cut presented in Section 4.3.2, and obtain the aggregated systems and 
AS\. 

Now we have to determine if the sizes of the state space of the aggregated 
systems *4<Sj and .45 j are small enough. If the answer is affirmative, we can com- 
pute the throughput of the original system with the iterative process presented in 
Section 4.3.3. Note that if this condition holds at the first hierarchical level, we are 
applying the single cut method. 

All systems that are not sufficiently “small” must be further decomposed. This 
means that for those systems we have to choose either a SISO or SIMO cut and apply 
the appropriate decomposition rule. 

In the general case, the number of systems can be large and distributed in 
several levels. The number of systems and the number of levels are related. If we 
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have l systems, at most /—I levels are implicated in the process. An abstract sketch 
of the recursive-interactive function for the quantitative analysis of the general case 
is presented below. All functions have been previously defined in Section 4.3.3. 

Algorithm2 : Recursive — Interactive Analysis with SISO cuts 
procedure solvesystem (net system , level) 

inquire if the net system is “small enough”; 
if small enough then solve the underlying CTMC; 

else if the net system has not been solved before 
then 

request a new SISO cut; 

use the decomposition rule to generate the left and right 
aggregated net systems, and the basic skeleton system; 
endif 
repeat 

solvesystemfleftsystem, level+l); 

solve Jbasicjskeleton; 

solvesystem (rightsystem, level+l); 
until e x < e(level) 

end if 

An important observation on the sketched algorithm is that the require- 
ments of accuracy on the different levels of the recursion can be different. In the 
test-convergence function, we can use a convergence criterion dependent on the hi- 
erarchical level, i.e., e(level). This is interesting in practice because lower levels 
are computed more frequently (they partially determine the time complexity) and 
the impact of less accurate results at lower levels is partially canceled at the upper 
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levels. The initial value problem for intermediate and lower levels is not detailed 
but the practice is to recover the last estimation at the corresponding level. 

Property 4.6.1 (Nlultiple SISO Outs) Let (A/* , Mq) be a live MG system. 

1. For every aggregated net system and/or the basic skeleton system, the bounds 
of the places are preserved: B(p) = B(p), Vp 6 P,-. 

2. Liveness in all aggregated net systems and the basic skeleton system is pre- 
served. 

Proof: Since the SISO decomposition rule is applied at each step of the hierarchical 
decomposition, Property 4.6.1 follows from Property 4.3.2. ■ 

In the hierarchical decomposition, the full environment is recursively defined 
using the notion presented for the single cut. At each level, the environment of a 
subnet contains all information of the environment of previous levels in its branch. 
In other words, the error in the representation of the environment is bigger at 
deeper levels of the hierarchy. So we can say that the accuracy of the hierarchical 
method will be an inverse function of the number of levels in the hierarchical tree 
of decomposed systems. 

Finally, recall that in the hierarchical decomposition, the approximated value 
of the throughput is computed by a recursive procedure. In order to solve each node 
of the decomposition tree, the children of the node must be solved. This means, 
that each iteration at level k implies a new computation of the throughput of all 
children at level k. So the analyst must evaluate the convenience of carrying out a 
new decomposition taking not only the size of the associated CTMC, but also the 
number of times the lower levels are evaluated into account 

4.7 Conclusions 

In this chapter we have introduced the structural decomposition of MGs. It is 
based on a single cut through a single or multiple interface. We have seen that the 
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SIMO and MIMO cuts correspond to a structural and marking transformation. Fur- 
thermore, in the MIMO cut the marking bound of places is in general not preserved. 
This justifies that in the following we concentrate on SISO and to some extent on 
SIMO cuts. It is important to note that the basic skeleton is in product form , i.e., 
we have decomposed the MG system according to Definition 2.3.1 (product form 
decomposition). Based on the aggregated net systems and the basic skeleton, we 
can use both RTA, and any of the other techniques developed for "almost” PFQNs. 

From the stochastic perspective we have introduced RTA, a new approximation 
technique for the case of single cuts. We have simplified RTP , so that in the proposed 
method the computation of the coefficient of variation is not required. RTA can 
be performed on a single level or can be used in conjunction with a hierarchical 
decomposition. 


CHAPTER 5 


Alternative Approximation Methods for Marked Graphs 
5.1 Introduction 

In the previous chapter we have explored the structural decomposition of MGs 
to derive the aggregated net systems. Based on the structural decomposition, we 
now present alternatives to RTA. In Chapter 4, we have presented Response Time 
Approximation (RTA), which is specifically designed for the single cut. In the 
following, we present Flow Equivalent Aggregation (FEA) and Marie’s method. 
Both are well known approximation methods for QNs, which are here applied in the 
context of MG systems. They derive from the fact that by decomposing the MG 
system by a single cut, the basic skeleton is in product form. 

Delay equivalence (DE) [LW91, WL91] is another approach for the iterative 
solution of MGs. Contrary to the previous approaches, it does not make use of the 
basic skeleton. 

In this chapter, we present the approximation methods. In Chapter 6, all of 
the presented methods are compared with several examples. 

5.2 Flow Equivalent Aggregation 

In MGs, the basic approach is to substitute subnet 57V,- by a flow equiva- 
lent transition (FET) t,-, whose (state dependent) firing rate is characterized by the 
number of tokens in its input place qji (i.e., the queue length of the flow equiv- 
alent server). After the aggregation phase, both subnets are substituted by their 
respective FETs (tj and T 2 ). From a structural point of view, this leads to the 
basic skeleton. The overall performance analysis is done by solving the underlying 
CTMC. 
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5.2.1 Qualitative Aspects 

The qualitative aspects of the decomposition have already been addressed in 
Rule 1: SISO Decomposition Rule (section 4.3.2) and Rule 2: Multiple Interface De- 
composition Rule (section 4.4). Based on a single cut, we generate both aggregated 
nets. AS-l (AS 2 ) will be used to analyze 5iV x ( SN 2 ) under a varying load which 
allows us to formulate the flow equivalent transition t x ( 7*2 )• 

5.2.2 Quantitative Aspects 

In FEA, the subnets are analyzed in short circuit. This corresponds to setting 
r x in AS 2 (f 2 in A5 X ) to an immediate transition. Alternatively we can merge the 
interface places and delete r,- (this is preferred, because it reduces the number of 
vanishing markings). For the quantitative analysis, the number of tokens k in the 
interface place q 2 \ has to be varied up to the maximum allowed, i.e., k = l . . . K, 
where K is the initial marking of q 2 1 and computed by solving LPP3. 

Algorithm 2: Flow equivalent analysis of single cuts 

1) Select a SISO cut £ 

2) Generate the aggregated net systems *4<S x and A<S 2 in short circuit by merging 
the interface places as well as the basic skeleton system using the Decomposi- 
tion Rule (SISO or Multiple Interface). 

3) Analyze AS i} i = 1,2 with m 0 (g 2 i) = /, I = 1 ... /^ by solving the underlying 
CTMC and obtain the conditional throughputs X x and X 2 . 

4 ) Set the service rates of the aggregated transitions to the conditional through- 
put, i.e., 


*(o = m 


i = 1 , 2 ; l = l...K 


(5.1) 
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5) Analyze the basic skeleton with the transition rates obtained from step 4. 

The above algorithm yields the maximum, reduction of the state space (higher 
than in RTA), but as we will see the accuracy can suffer. Alternatively, we can 
aggregate only one of the subnets (e.g., SN X ) and substitute the state dependent 
rate of p x into AS 2 . The state space will be higher, but in general the quality of 
the approximation will be better. 

5.2.3 Flow Equivalent Aggregation vs. Response Time Preservation 

In FEA the assumption is that the service time depends only on the number of 
customers which are currently present in the subsystem. In FEA the behavior of the 
subsystem is assumed to be independent of the arrival process and depends only on 
the number of customers in the system, i.e., the behavior is completely independent 
of the environment. This assumption is violated in several cases. 

Let us illustrate the fact that even in very special cases the mean completion 
(or traversing) time (MCT) may depend on the interarrival process of the token. 
Informally, the MCT is the average time spent by a token traversing the subnet. 
Figures 5.1(b) and 5.1(c) show two 1-bounded MG systems. In the first (second) 
we assume r 2 (r x ) to have different rates. Figure 5.1(b) essentially represents, the 
circuit (pi - t a - - t c - p 3 - r 2 ) in which “the server” at t a has a setup time 

modeled by Figure 5.1(c) is essentially a cyclic fork-join system. 

The system in Figure 5.1(b) can be interpreted as a M/G/l/l queue where 
r 2 represents the exponential server, while SN X , represents the general server. Note 
that the service time of G is just the MCT of SN X (a similar argument holds for the 
system in Figure 5.1(c)). The cycle time in an AI/G/ 1/1 queue is simply the sum 
of the service times of M and G ( insensitivity ) [GH85]. The MCT of SN, r SlV , is 
therefore given by: 


r 5 .v = r — o 


(5.2) 
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?21 


(d) Basic Skeleton 

Figure 5.1: Aggregated Net and Basic Skeleton Systems Obtained from the Cut £ 
( Service Times: s a = 0.33, Sj = 1.0, s c = 0.25, sj = 0.20, s e = 0.33, s g = 0.5) 


where T is the mean interfiring (cycle) time with respect to any transition, and 9 
denotes the firing (service) time of any transition r, 9 = s n — s-^. 

Table 5.1 presents the numerical data of the analysis of the systems in Figures 
5.1(b) and 5.1(c). Both systems are 1-bounded, thus the variation in I^at, (from 
1.383 to 0.583) can be explained only due to the dependency between the general 
(time dependent, in particular) and the exponential server. If 9 is very large ( 9 
10), tb will almost surely fire before the firing of r 2 ends. Thus, T = s a + -s c + 9 
and TsNx = s a + s e - If 9 is zero (i.e., r 2 is immediate), then Fsa/, will be much 
higher because now it depends on, sj, the firing time of plus s c . On the other 
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System 

in Fig. 5.1(b) 

System in 

Fig. 5.11(c) 

II 

Co 

II 

Co 

3 

T b 

Ts/v, 

r c 

r 5 ^ 

0 

1.383 

1.383 

1.783 

1.783 

0.1 

1.411 

1.311 

1.883 

1.783 

1.0 

1.983 

0.983 

2.783 

1.783 

2.0 

2.850 

0.850 

3.783 

1.783 

10.0 

10.656 

0.656 

11.783 

1.783 

» 10 

9 + 0.583 

s„ + s c = 0.583 

9 + 1.783 

1.783 


Table 5.1: Dependence and Independence of the Mean Completion Time on the 
Interarrival Time 

hand, the value of Tsn^ (Table 5.1) is constant, i.e., for the given initial marking 
there exists a perfect decoupling (independence) between the general (in this case 
time independent) and the exponential queues. This is easy to explain because the 
1-bounded system is insensitive to the pdf of the server: both servers act as delay 
nodes. 

The case of Figure 5.1(b) is just an illustration of the fact that the existence of 
internal loops in a subnet may violate the independence condition. Another situation 
where the independence condition can be violated is when trapped tokens exist in a 
fork- join. To illustrate this case, assume a token is added to p 4 in Figure 5.1(c). 
Even if p 4 and p$ are now 2-bounded, 512 Is 1-bounded and the general server " works 
as with a single customer. Therefore, insensitivity in the M/G/ljl queue remains 
and Eq(5.2) is valid. Nevertheless, with the modified marking the MCT of SN 2 is 
no longer independent of 9 (see Table 5.2). 

For the particular case of the examples (1-bounded exponentially timed MG 
systems), it is easy to observe a monotonicity property: increasing the service time 
9 (representing a complementary subsystem) never leads to a bigger MCT for the 
subsystem under consideration (see Tables 5.1 and 5.2 for and respec- 

tively). This is easy to understand: the strong dependency between the general 
and the exponential servers is due to the fact that the general server (i.e., the one 
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9 = 3^= 1 / Pi 

r c 

Tstv, 

0.25 

1.556 

1.306 

0.5 

1.757 

1.257 

1.0 

2.198 

1.198 

2.0 

3.141 

1.141 

4.0 

5.097 

1.097 


Table 5.2: rsw 2 Depends on 9 for the System in Figure 5.1(c) if a Token is Added 
to p 4 

summarizing SN ,) is time-dependent. But as MGs are conflict-free, the increase of 
the interarrival time of tokens eventually allows the tokens in the subnet to progress 
towards the output places (i.e., q l2 for SNi and q 2 1 for SN 2 ), resulting in a smaller 
or equal (but never larger!) MCT. 

Because of the insensitivity of MjG/l/l queues, the estimated values of the 
cycle time T in FEA, (r fea) a ^e as follows: 

(a) Figure 5.1(b): = 9 + 1.383 (ft T b , Tables 5.1) 

(b) Figure 5.1(c): V C FEA = 9 + 1.783 (= T c , Table 5.1, because of the indepen- 
dence) 

If the FEA is performed on SN\ for the marking in Figure 5.1(b), for 9 = 2, 
an error of 18.7% in the computation of the throughput is obtained. 

In response time preservation (RTP) [ABS84] the response time of the subsys- 
tem is replicated. Here the assumption for the analysis phase is a Poisson arrival p of 
customers representing the environment of the subsystem. During the aggregation 
phase the RT is replicated by an exponential server, i.e., 

RTsw(p) — RTan(p) ( 5 - 3 ) 

The advantage of iterative methods lies in the fact that it can model the inter- 
action of subsystems. The drawback is that convergence might eventually present a 
problem. 
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rr. ~ r>» 

" H Subnetwork R * 1 m 

Figure 5.2: Open Subnetwork R l 0 

5.3 Marie’s Method 

The first step in Marie’s method is the Product Form Decomposition of the 
Petri net according to Definition 2.3.1. For a SISO cut in a MG system, these 
conditions are obviously satisfied. 

Let R l be the /- th subsystem with n l customers present, A ! (n‘) the state de- 
pendent arrival rate and K the total number of customers. The open system in 
isolation is denoted by R l Q as shown in figure 5.2 and is analyzed in isolation under 
a state dependent Poisson arrival process. 

Suppose that A'(n') is known, then R l 0 can be analyzed. Note that although 
the system is nominally open, the arrival rate to the system is zero, once K customers 
are in the system. For the case of a single cut in MGs, the closed system has the 
structure of an aggregated net system. Now the transition r,- (with firing rate A'(n')) 
is the generator of the arrival of customers to SiV,-, i ^ j. 

Particularly, we are interested in the steady state probability Pr{n l ) of haring 
n customers in the open subnetwork. Then the conditional throughput can 

be- obtained as [Mar79]. 

v‘ M (n l ) = A'(n' - l) ^LZyl for n l = 1, ... ,K (5.4) 

As in FEA, the service rates p l (n l ) are set to the conditional throughput. 

/zV) = (5.5) 

The problem is now reduced to finding the state dependent arrival rates A^(n^) 
for the open subnetwork Rq. To this end, the partitioned system (now in product 


^Subnetwork R^ 
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^Z'' • • • 

^J® 

Ml M2 Mw 

Figure 5.3: Birth-Death Process 

form) is analyzed with the rates obtained from equation (5.5) (either by using mean 
value analysis [RL80] or simply the analysis of the CTMC). 

A l (n , ) = <. , (n' + l) f p( n t ) 1) ' n‘ = 0,...,K-l (5.6) 

Having computed the state dependent arrival rates the iteration can be re- 
peated. 

An alternative interpretation of Marie’s method for single cuts in MGs is the 
following: We analyze AS,- with a state dependent arrival rate A(n), n = 1 ... K 
and obtain the conditional probabilities of having n customers in the subsystem. 
We then interpret AS, as a general Markovian birth-death process as illustrated in 
Figure 5.3. 

Here, the states represent the number of customers in SNi. In Figure 5.3 the 
state dependent arrival rates A(n) (birth rates) are known, while we are interested in 
computing the state dependent service rates n(n) (death rates). From the underlying 
CTMC of AS,, we can compute the conditional probabilities of having n customers 
in SNi. 

By formulating the local balance equations on the CTMC of the birth-death 
process in Figure 5.3 at tu-th station (flow in = flow out): 

A(u; - l)Pr(w - 1) = fi(w)Pr(w), w — 1 ... K (5.7) 

From eq. (5.7) immediately follows 

W = l --- K 


(5.8) 
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which is just eq. (5.4) in slightly different notation. fi(w) is going to be the generator 
for the arrivals of ASj, i.e., it denotes the conditional throughput of SN{. 

As with many iterative methods, the uniqueness of the solution cannot be 
proven although numerical experience has shown that a unique fixed point does 
indeed exist, although convergence sometimes presents a problem [BD90]. 

For the case of K = 1 (independence of M/Gf 1/1 queue), equation (5.4) and 
(5.6) are equivalent to equation 4.3 for RTA, i.e., for K = 1 both methods are 
equivalent. 

5.4 Delay Equivalence by Y. Li and M. Woodside 

The work of Li and Woodside [LW91], presents an alternative approach to 
compute the approximate throughput for stochastic MGs. In their work, the orig- 
inal system is also split into subsystems and a delay equivalence (DE) criterion 
is used for throughput approximation. Their service rates for aggregated subsys- 
tems are marking dependent. In a very recent work [WL91], the authors present 
an application, while the computed service rates are made constant in order to get 
acceptable robustness. 

Consider the two aggregated net systems in Figure 5.4 (called auxiliary systems 
in Woodside’s paper). In ASi, a customer suffers two delays as he cycles through 
the system: a delay in 5 N\ itself (D x ), and a delay through the aggregation of SN 2 
(d 2 represented by q x2 and r 2 with firing rate fi 2 ). Similarly d x and D 2 represent the 
delays in AS 2 (note that d represents the delay in the aggregate transitions r, while 
D represents the delay in the unaggregated subsystems, i.e., SN). 

Let I< be the customer population (i.e., the number of tokens in the basic 
skeleton), Efoy] the expected number of tokens in place q {j , X, the throughput of 
AS,- and n i(l) the state dependent firing rate of r, when l customers are present in 
its input place The delays can obtained by Little’s law: 
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Figure 5.4: Aggregated (Auxiliary) Net Systems 


A 

d 2 

A 

d i 


K — E[<7i 2 ] 

- EfeiJ 
* 2 
E[?2l] 


( 5 . 9 ) 

( 5 . 10 ) 

( 5 . 11 ) 

( 5 . 12 ) 


The key to Woodside’s method is, that in steady state the delay a token suffers 
in the subnet is the same as in the aggregate. For example, the delay of a customer 
in SNi of ASi should be the same as in q 2 i and r t in AS 2 . Therefore 


A = d x 
D 2 — d 2 


( 5 . 13 ) 

( 5 . 14 ) 


5.4.1 State Dependent Firing Rates 


Let us first consider the case when the aggregated transition has a state de- 
pendent firing rate [LW91]. The basic approach is to first compute the equivalent 
delay for / = 1 (l//x(l)) where / is the customer load. This information is used to 
compute l/n(2) and successively up to 1 /p(K). 

(A) Case: K — 1 
Here, 


Efoj 1 

Xi 


( 5 . 15 ) 
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(from Little’s law, X\ 

= P 2 E[q u }). 

Now with d 2 

= D 2 and 

writing the equation in 

iterative form, 






„(r+i) 

x\ r) 

1 

(5.16) 


^2 = T 

- Eta,]!') “ 

D s 2 N ' (r) 

similarly for p[ r+1 \ 


xj r) 




„(r+l) 

1 

(5.17) 


Pi = - 

- E[? 12 ]fr> " 

Df N ' {r> 


Property 5.4.1 For the particular case of a single customer, delay equivalence and 
response time approximation are equivalent. 


Proof: Let us consider AS 2 at a given iteration: We know X 2 and pi and we have 
to determine p 2 . We can either solve equation (4.3) or simply write: 


1__ J_ + J. 



*2 Pi + P 2 

(5.18) 

rearranging terms yields 

X 2 pi 
P2 = v 

Pi — X2 

(5.19) 

We have to show that equation (5.16) and (5.19) are the same, i.e., 



Ml __ 

Pl — X 2 1 — E[^2l] 

(5.20) 


From [GH85], we know that the steady state probability of an M/G/c/c and 
an M/M / c / c queue are the same if the means of the servers are the same. Therefore, 
we can write E[qio] and X 2 as if they were obtained by solving an M/M/ 1/1 queue. 


= 


A*i + P2 


and X 2 = 


Pi + P2 


(5.21) 


By substituting eq. (5.21) into (5.20), the equality follows. 


Based on the previous results, the following property holds: 

Property 5.4.2 Because of the independence of the M/l/G/l/l queue, for the case 
of a single customer (I\ = 1), RTA, Marie’s method and delay equivalence yield the 


same results. 
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(B) Case: K > 1 

In the previous steps, the firing rates of r for n = 1 . . . L — 1 customers have 
been determined. The present step is concerned with finding n(L). 

Substituting X x = E^i ^(n) ■ Pr[m(g 12 ) = n] into equation (5.10) 


d, = 


E[?12] 


Ti (I) • Pr(m(? 12 ) =1] + (n) " Pr[m( 9l2 ) = n] 

As before, set d 2 = D 2 and by using eq. (5.11) the iterative equation follows: 

1 


(5.22) 


H 2 (L) = 


Pr (r) [m($ 12 ) = Z] \L 


E Jff. 2 - -T - J2 vM • Pr (r> Mgi2) = n l) ( 5 - 23 ) 

"" & mil n=l / 


Similarly, /Xj r+1) ( L ) is given by; 

1 / 


E (r> [?2l] 

(*->f 


( r + 1 ) / r \ 

Ml W = 


L-l 


— ^i(^) - Pr (r> [m(qr 21 ) = rz] ] (5.24) 


Pr (r) [m(g 2 i) = £] \ ^ — E c > [^ 12 ] n =i 


The iteration has to be performed for L = 1 - . . K • 

Note that for the computation of tx 2 +1) (N) the quantities Pr[m( 5 12 ) = n], n = 
1...K and E[g 12 ] are computed with AS X while E[g 2i ] is computed with AS 2 (sim- 
ilarly for At the current iteration, quantities from the analysis of both 

subnets are used. 


5.4.2 State Independent Service Rate 

We now examine DE for the case of state independent firing rates for the 
aggregated transitions [WL91]. 

Substituting X x — n 2 • (1 — Pr[n 2 (<?i 2 ) = 0]), into equation (5.10) 


, Efo] 

2 “ /i 2 • (1 - Pr[m(?i 2 ) = 0]) 

As before, set d 2 = D 2 and by using eq. (5.11) 

K - Efei] E(g 12 ] 

X 2 p 2 • (1 - Pr[n*(?i2) = 0]) 


(5.25) 


(5.26) 
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Rewriting eq. (5.26) in iterative form yields 


m5 +1 = 


E r [gn] 

I< - E r [g 2 i] (1 - Pr r [m(? 12 ) = 0]) 




Similarly 


r+1 _ E r [g 2 i] 


/V = 


X{ 


(5.27) 


(5.28) 


I< - E r [M (1 - Pr r [m( 9 2i) = 0 ]) 

As for the case of state dependent firing rates, quantities from the analysis of 
both subnets are used. 


The iterative process is now straightforward. The structural decomposition is 
the same in all methods, only the computation of the new firing rate changes (step 
5 in Algorithm 1: Quantitative Analysis of SISO cuts). 


5.4.3 Comparison of Response Time Approximation and Delay Equiva- 
lence 

In order to compare the two methods, it is useful to examine the solutions 
when the iteration has reached steady state, i.e., X x = X 2 . From (5.10) and (5.11), 
we have 


EM = K - E[?2i] (5.29) 

and the first term of eq. (5.27) disappears. In order to interpret the results, we will 
indicate with superscripts how the quantities are computed, e.g., X{* Sl indicates 
that X x is computed by solving the underlying CTMC of AS X . Using equation 
(5.29), (5.27) simplifies to: 

Xfs* 

(1 “ Pr[m(? 12 ) = OP*) ^°‘ 30 ^ 

In steady state, the above formula also holds for RTA, except that Pr[m(j 12 ) = 
0] is computed based on the basic skeleton, while DE uses Pr[m(? 12 ) = 0] as com- 
puted with ASi. This derives from the fact that in RTA we reduce the subnets to 
the basic skeleton. 


i 
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use of BS 

use of AS i ,2 

Iterative 

Information used 

RTA 

X 

X 

X 

Throughput 

Marie 

- 

X 

X 

Marking probabilities 

Delay 

Equiv 

“ 

X 

X 

Throughput and 
Marking probabilities 

FEA 

X 

X* 

' 

* Throughput as a function 
of number of customers 


Table 5.3: Comparison of the Stochastic Approximation Methods 


5.5 Conclusions 

FEA is computationally efficient, although its accuracy may be low due to the 
previously outlined problems. Its accuracy can be improved by aggregating only 
one subnet to a flow equivalent transition. 

It has been shown that Marie’s method can be used in the context of decom- 
position of MGs by a single cut because the basic skeleton is in product form. 

DE works directly with the delay in the paths and does not reduce or make the 
abstraction to a basic skeleton. In this thesis we use a response time approximation 
based on the throughput of the aggregated net systems, i.e., we reduce our system 
to the basic skeleton (an M/M/l/K queue). In steady state, the results using both 
methods are different. 

For the case of a single customer, RTA, Woodside's method and Marie’s 
method yield the same result due to the independence of the M/G/l/K queue. 

Table 5.3 summarizes the characteristics of the four presented methods. 




CHAPTER 6 

Examples for Marked Graph Systems 

6.1 Introduction 

In this chapter we give several numerical examples for the SISO and SIMO 
cuts introduced in the previous section. We compare the results obtained with 
single level and hierarchical Response Time Approximation (RTA), Flow Equivalent 
Aggregation (FEA), Marie’s method and delay equivalence. 

6.2 Fork/ Join Queueing Network 

The net system on top of the hierarchy in Figure 6.1 shows a Fork/ Join QN 
with blocking. The exact value of its throughput is 0.22866, computed with the 
underlying CTMC which has 33480 states using the tool SPNP [DBCT85], 

6.2.1 Single SISO Cut A 

We define the SISO cut A through p 4 and ps (not shown in Figure6.1). We 
now follow the steps for the analysis outlined in Algorithm 1. 

(l.a) The net is partitioned (Fig 6.1) by means of the canonical SISO cut £4 through 
p 4 and p 5 . ASi is defined by the transitions left of the cut, while AS 2 is defined 
to the right of £4. 

(l.b) A solution for LPP3 (Section 4.3.2) is: 

M (pi) = 2, M (p A ) = 3, A/(p 6 ) = 3, M{pr) = 2, M (p 8 ) = 3, M (p 12 ) = 3, A/(p 13 ) = 2. 
i.e., the customer load at cut A is K = M(q 2 i =p^) = 3. 

(l.c) In order to show the rapid convergence of RTA (Section 4.3.3), we arbitrarily 
select the service rate of r 2 as = 20.0. 
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Figure 6.1: Fork/ Join Queueing Network with Aggregated Net Systems Obtained 
from the Hierarchical Decomposition at Cut £b and £c ( Service Times: Si = 0.33, 
52 = 2, S 3 = 1, 5 4 = 4, 5 5 = 2, S 6 = 0.5, S 7 = l, S8 = l> 5 9 =1, 5i 0 = l ,5x1 = 1) 

(2) The results of the iterations are shown in Table 6.1: 

6. 2.1.1 Response Time Approximation 

In order to show the rapid convergence of the method, the iterative results 
are shown in Table 6.1. The initial value of /i® = 20.0 is far off the final value of 
0.59674. 

The error of the throughput introduced by the aggregation is -0.14%, while 
ASx has only 372 states and AS 7 has 360 states. By finding proper initial values 
for the iteration (Algorithm 1, Step 3) the maximum cycle time in SNi is 1.278, 
therefore = 0.7826. This way, we reduce the number of CTMCs which have to be 
solved for the same precision by one. 
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^ = 20.0; A™ 

, c t = 0.22866 

r 

■E2SI 

■ESI 



i 

0.23281 

0.23281 

0.22444 

0.60422 

Q 

0.22852 

0.23749 

0.22824 

0.59674 

Q 

0.22833 

0.23759 

stop 



Table 6.1: Results Using RTA for SISO Cut A at Fork/Join QN 
6. 2. 1.2 Flow Equivalent Aggregation 

In this example the firing rate of is the same for all possible customer 
populations. Here, the merged interface places form a selfloop with r 2 , i.e., all 
tokens are trapped. Table 6.2 shows the conditional throughputs as a f un ction of 
the number of tokens. 



AS! 

as 2 

conditional throughput 

0.3430406 

0.232809 

tangible state space 

93 

90 


Table 6.2: Results Using FEA for SISO Cut A Fork/Join QN 

By substituting the values in Table 6.2 into the basic skeleton, the throughput 
is computed to be equal to 0.20313, i.e., the error introduced by the FEA is —11.2%. 

6. 2. 1.3 Delay Equivalence 

Table 6.3 shows the results using state independent delay equivalence aggrega- 
tion (Section 5.3. The number of iterations required to reach convergence is higher 
than in RTA, while the accuracy is comparable (the introduced error is 0.18%). 

6.2.2 SIMO/MISO Cut B 

The resulting net systems are shown in Level 1 of the hierarchical decomposi- 
tion presented in Figure 6.1 (.4<S{ and .AiS 2 ). A solution for LPP5 (Rule 2 in Section 
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Pi = 1.0; 

wc 

II 

1—* 

0 

^exact “ 0.22866 

r 

K 

p\ 

x; 


T" 

0.342741 

0.167135 

0.146091 

0.189616 

2 

0.186681 

1.310073 

0.232623 

0.226793 

" 3 " 

0.219454 

0.561616 

0.227288 

0.247681 

4 

0.236306 

0.624189 

0.228989 

0.236558 

5 

0.227493 

0.638746 

0.229293 

0.238451 

6 

0.229019 

0.627703 

0.229065 

0.238784 

7 

0.229287 

0.630124 

0.229117 

0.238509 

8 

0.229066 

0.630256 

stop 



Table 6.3: Results Using Delay Equivalence for SISO Cut A at Fork/ Join QN 
4.4) is: 

M{px) = 2, M(p $ ) = 3, Mfa) - 2, M(p 6 ) = 3, M{pi 2 ) = 3, A/(pi 3 ) = 2. 
i.e., the customer load at the cut is: K = 2. 


6.2. 2.1 Response Time Approximation 

For this cut, the error introduced in the throughput is 0.18%. A 4 SJ has only 48 
states, while AS] has 3240 states. The large difference in the size of the respective 
state space is due to the fact that the cut is not balanced in the sense that it 
does not produce aggregated net systems with roughly the same state space (which 
is obviously bad from a computational point of view). Nevertheless, it has been 
considered for illustrative purposes. Table 6.4 shows the iteration results. 


= 20.0 n 

F 

■i 1 

■ESI 


Pi 

a 

0.23125 

0.23122 

0.68693 

0.21317 

0 

0.25172 

0.22919" 

0.67390 

0.22851 

□ 

0.25248 

0.22909 

0.67343 

0.22906 

a 

0.25251 

0.22908 

stop 



Table 6.4: Results Using RTA for SIMO Cut B for Fork/ Join QN 


















90 


6. 2. 2. 2 Flow Equivalent Aggregation 

Similar to the previous example, the conditional throughput of *4<Sj and ASj 
(i.e., the firing rate for t x and r 2 ) is the same for all possible customer populations 
as shown in Table 6.5. The throughput obtained using FEA is equal to 0.19712, i.e., 
the error introduced by the aggregation is —13.79%. 



ASi. 

AS-2 

conditional throughput 

(n = 1) 

0.43619 

0.22941 

tangible state space 

(n = l) 

15 

1080 

conditional throughput 

(n = 2) 

0.46801 

0.23123 

tangible state space 

( n — 2) 

27 

1440 


Table 6.5: Results Using FEA for SISO Cut B in Fork/ Join QN 


6.2.2. 3 Delay Equivalence 

Table 6.6 shows the results using state independent delay equivalence aggre- 
gation (Section 5.4) at cut B. The error introduced by the aggregation is 0.05%. 



O 

i-H 

II 

r— 1 
H 

Ml) = 1-0 

D 

MK39 


x; 

^2 

D 


0.347934 

0.215488 

0.254321 

Q 

0.230400 

0.740182 

0.229558 

0.241750 

U 

0.221138 

0.632954 

0.228680 

0.254576 

LL 

0.230585 

0.639804 

0.228756 

0.251808 

o 

0.228572 

0.641122 

0.228770 

0.252058 

a 

0.228755 

0.640600 

0.228765 

0.252084 


Table 6.6: Results Using Delay Equivalence for SISO Cut B at Fork/ Join QN 
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6.2.2A Marie’s Method 

Table 6.7 shows the results obtained using Marie’s method for both cuts. The 
error introduced by the aggregation is — 0.09% for cut A and 0.23% for cut B. 


All initial values 1.0 

r 

A (cut A) 

A(cut B) 

1 - 

0.193942 

0.201954 

2 

0.236053 

0.245348 

3 

0.227551 

0.227600 

4 

0.228805 

0.229971 

5 

0.228446 

0.229053 

6 


0.229177 


Table 6.7: Results Using Marie’s Method for SISO Cut B at Fork/ Join QN (through- 
put shown for the basic skeleton system) 


6.2.3 Hierarchical Decomposition using RTA 

Figure 6.1 also shows the hierarchical decomposition. The marking of the 
aggregated nets are obtained from Algorithm 2 (Section 4.6). Cut B partitions the 
original net system into the aggregated net systems and AS\ at Level 1. On 
the next level, only is partitioned by C to form -AS^i and AS 22 - %2 is computed 
at Level 2 using AS ^ and AS\ 2 - The initial rate for was chosen as 20.0. The 
iteration results are shown in Table 6.8. 


| = 20.0 

r 

AT 1 

■ESI 


Mi 

T 

0.46735 

0.46760 

0.22376 

0.27148 

Q 

0.24255 

0.66209 

0.23042 

0.25512 

U 

0.23097 

0.67181 

0.23058 

0.25459 

B 

0.23059 

0.67199 

stop 



Table 6.8: Results Using Hierarchical Decomposition at Fork/ Join QN 
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The original net system has 33480 states, while *45} has 48 states, AS] t has 
144 states, and *45 22 has 90 states. The error introduced during the aggregation 
is 0.84%. During the iterative process, *45} was analyzed a total of 4 times while 
*45 21 an d -45} 2 were analyzed 7 times. 


6.2.4 Comparison 

Table 6.9 shows a summary of all results for the throughput of the Fork/Join 
QN, as well as the largest state space of an aggregated subnet encountered during 
the approximation. 


original net: X = 0.22866, State Space = 33480 

Cut A 

Method 

Throughput 

Error 

State Space 

Iterations 

RTA 

0.22833 

-0.14% 

372 

3 

FEA 

0.20313 

-11.16% 

93 

— 

Marie’s Method 

0.22845 

-0.09% 

372 

5 

Delay Equivalence 

0.22907 


372 

8 

Cut B | 

RTA 

0.22908 

0.18% 

3240 

H 4 

FEA 

0.20313 

-13.79% 

1440 

— 

Marie’s Method 

0.22918 

-0.23% 

3240 

6 

Delay Equivalence 

0.22907 

0.05% 

3240 

6 

Hierarchical RTA 

0.23059 

0.84% 

144 

4 


Table 6.9: Comparison of the Approximation Results for Fork/Join QN 


With the exception of FEA, all methods show very good accuracy at a signif- 
icant computational saving. RTA shows the fastest convergence. 
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6.3 Four Task Pipeline 

We now analyze the MG in Figure 4.7 whose structure (but not the marking) 
represents a four task pipeline found in concurrent software environments. The 
example is inspired by the example in Y. Li’s paper [LW91]. It is analyzed with two 
different initial markings: 

M$ ] : This is the marking shown in Figure 4.7. The exact value of the throughput 
for M i l) is equal to 0.513195, computed with the underlying CTMC which has 
4500 states (using the tool SPNP [DBCT85]). Note that P 2 o> P 21 and P 22 are 
implicit places for the initial marking shown. 

M™ : Now M(p 2 i) = l, M(p 4 ) = l and M(p 19 ) = 2, while all other markings remain 
the same. None of the places in the net is now implicit. The exact throughput 
of this MG system is 0.490362, while the underlying CTMC has 2916 states. 

- The service times are as follows: 

3l = 0.20, s 2 = 0.20, s 3 = 0.10, s 4 = 0.10, 35 = 1-00, 3 6 = 0.10, 37 = 0.20, 3 8 = 1.20, 

39 = 0 . 10 , 3io = 0.09, 311 = 0.90, 312 = 0 . 08 , 313 = 0.07, 314 = 0.15, 315 = 0.10, 316=1.00. 

6.3.1 Single SISO Cut B 

We define the SISO cut B through p 7 and p 17 as shown on top of the hierarchy 
in Figure 4.7. The aggregated net systems are shown in the first level of the decom- 
position (>151 and AS\). The net in Figure 4.7 is already shown with a solution 
for LPP3, i.e., the number of tokens in 921 (921 = Pit , Figure 4. 1 ) has already been 
maximized. The customer load at cut B is two. 

6.3.2 SISO Cut B, Response Time Approximation 

In order to examine the speed of convergence of RTA as a function of the 
initial values of the iteration, we study the system for different values of r 2 in 


eSKfcWJL PSC€ ta 

Or POOH QUALITY 
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Initial Marking 

Initial Marking 


Iteration Resu 

Its for A = 0.1 

r 

xr l 

■ESI 


A 

msM 

IHSI 


A 

D 

.098578 

.784181 

.557147 

0.89741 

.098764 

.845246 

.512319 

.707871 

B 

.481131 

.615319 

.503148 

1.03136 

.471021 

.705199 

.485372 

.753004 

El 

.498699 

.607624 

.499849 

1.03956 

.482587 

.697578 

.483479 

.755988 

_4_ 

.499609 

.607214 

stop 

MSWi 

.483300 

.697094 

.483357 

.756180 


Iteration Resu 

Its for = 

B 

.495060 

.609250 

.500553 

1.03780 | 

.525254 

.664642 

.474687 

.769652 

B 

.499416 

.607301 


KK 

EB 

.694906 

.482804 

.757049 

Q 

.499647 

.607197 

stop 

■■1 

.483553 

.696922 

.483314 

.756248 





■H 

.483362 

.697052 

stop 


Iteration Results for /z2=10.0 


a 

.570988 

.572760 

.483818 

1.07983 

jEfflFETil 

.579211 

.446575 

.811949 

El! 

.503844 

.605292 

.498832 

1.04209 

.495621 

.688439 

.481142 

.759651 

ai 

.499886 

.607089 

.499616 

1.04013 

.484168 

.696502 

.483208 

.756414 

ai 

.499673 

.607186 



.483402 

.697025 

.483340 

.756207 


Table 6.10: Results Using RTA for SISO cut B with Different Initial Values of r 2 
at Pipeline System 

Table 6.10 shows the iterative results for the two initial markings. The error 
of the throughput introduced by the aggregation for is —2.71%, while AS} has 
75 states and AS} l 135 180 states. For the error of the throughput introduced 
by the aggregation is -1.45%, while AS} has 81 states and AS} has 108 states. 

From the results of the different initial values in Table 6.10 we see that initial 
values “close” to the final ones can reduce the number of iterations by one. 

6.3.3 SISO Cut B, Flow Equivalent Aggregation 

Table 6.11 shows the conditional throughputs obtained with all possible cus- 
tomer populations. For the throughput obtained using FEA is equal to 0.38897, 
therefore the error introduced by the aggregation is —24.2%. For M^\ the approx- 
imated throughput is 0.37797 with an error of —22.91%. 
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Initial Marking Mq 1) 

AS\ 

AS\ 

conditional throughput 

(n = 1) 

0.50267 

0.62190 

tangible state space 

(n = 1) 

20 

60 

conditional throughput 

(n = 2)1 

0.57433 

0.62190 

tangible state space 

(n = 2) 

50 

60 

Initial Marking A/q 2) 

J I5j 

as 2 

conditional throughput 

(n = 1) 

0.57742 

0.55699 

tangible state space 

(n = 1) 

36 

36 

conditional throughput 

(n = 2) 

0.57742 

0.55699 

tangible state space 

(n = 2) 

36 

36 


Table 6.11: Results Using Flow Equivalent Aggregation for SISO Cut B at Pipeline 
System 

6.3.4 SISO Cut B, Marie’s Method 

Table 6.12 shows the results obtained by using Marie’s method for both initial 
markings. All initial values were taken to be one. 


All initial values 1.0 

r 

X(M^) 


T 

0.455636 

0.428801 

2 

0.519786 

0.508209 

3 

0.499322 

0.479429 

4 

0.505794 

0.490432 

5 

0.503682 

0.486397 

6 

0.504346 

0.487948 

7 

0.504129 

0.487379 


Table 6.12: Results Using Marie’s Method for SISO Cut B at Pipeline System 

(Throughputs shown for the Basic Skeleton System) 


The error introduced by the aggregation is —1.77% for and —0.61% for 
A/q 2) . Convergence did not present a problem. 
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6.3.5 SISO Cut B, Delay Equivalence 

In Table 6.13 the iteration Results Using state dependent delay equivalence 
with the initial marking Mq are presented. The values of m are now a function 
of the number of customers n in its input place. In a previous step, the firing rates 
with a costumer load equal to one were determined to be: /i x (l) = 1.00010 and 
^ 2 ( 1 ) = 0.60499. These values were also used as the initial values for p°(2) and p 2 (2) 
respectively. The second (fourth) column shows the throughput obtained with ASi 
(AS 2 ) at the current iteration. In the third (fifth) is the new firing rate of p 2 (2) 
(/Xx(2)) which is going to be used in the next iteration in .45 x (^4cS 2 ). 



A*i(l) = 

1.00010 

p 2 {l) = 0.60499 

r 

x{ 




1 

0.495072 

1.214634 

0.498700 

0.623275 

H 

0.506690 

1.122975 

0.505133 

0.617402 

Q 

0.502279 

1.100289 

0.503109 

0.611861 

B 


1.129935 


EEE20I 

K3 

0.502640 

usmzm 

ESBEI 

0.615856 

Q 

imi 

WS5SMHM 

0.502569 

0.615088 

B 

0.501956 

1.127689 



H 

0.502524 

■mama 


0.615761 

ill 

0.502407 


BilMiMiMil 

0.615194 


Table 6.13: Results Using State Dependent Delay Equivalence for SISO Cut B and 
Mq 1] at Pipeline System 

The error introduced by the approximation is —2.1%. Note that the initial 
values of pi(2) and p 2 (2) are already very close to their final values, still it takes 9 
steps to achieve convergence. If initial values for p 2 {2) = Pi(2) = 1.0 are used, the 
method takes 29 steps to converge. For the same net structure, but for different 
initial markings, we were not able to find initial values for which state dependent 
delay equivalence converges. This method is not robust, i.e., convergence is depends 
very much on the initial marking and is often difficult to achieve. 
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Table 6.14 shows the results for state independent delay equivalence. For 
the introduced error is —2.44% (—2.49% for M q 2 ^). 



= 1.0 

nl = 1.0 


Initial Marking 

r 



x; 

K(2) 

1 

0.495060 

0.530136 

0.461693 

1.243986 

2 

0.517779 

0.609419 

0.500626 

1.016688 

3 

0.497029 

0.613985 

0.502582 

1.048344 

4 

0.500566 

0.608701 

0.500315 

1.051924 

5 

0.500950 

0.609637 

0.500720 

1.049439 

6 

0.500684 

0.609703 

stop 


Initial Marking Mq 2) 

r 


(4(2) 

x; 

Pi(2) 

T 

0.525254 

0.543843 

0.432350 

0.801388 

2 

0.493443 

0.699546 

0.483973 

0.705709 

3 

0.470429 

0.680914 

0.479159 

0.740314 

4 

0.479483 

0.675243 

0.477630 

0.736225 

5 

0.478459 

0.678178 

0.478425 

0.735028 

6 

0.478157 

0.677693 

0.478294 

0.735662 


Table 6.14: Results Using State Independent Delay Equivalence for SISO Cut B 
and at Pipeline System 


6.3.6 Hierarchical Decomposition 

Figure 4.7 shows a hierarchical decomposition by the cuts A, B and C. The 
marking of the aggregated nets are obtained from the procedure outlined in Algo- 
rithm 2 in Section 4.6 (with the shown marking, the number of tokens in all interface 
places is already maximized). Cut B partitions the original net system into the ag- 
gregated net systems A«Sj and A<S 2 at Level 1. On the next level, A <5} is partitioned 
by A to form A5 i 2 and A<Sj 2 , while A«S 2 is partitioned by C to form A5 21 and 
A«S 22 . X\ (X 2 ) is computed at Level 2 using AS^ and AS\ 2 (A«S 21 and AS 22 ). The 
iteration results are shown in Table 6.15, while a comparison of the state space and 




98 


the number of iterations is shown in Table 6.16. All initial rates were chosen to be 

10 . 0 . 


Initial Marking 

r 

xr^ 

/X 1 

x; 


T 

0.548352 

0.5499239 

0.466712 

1.056116 

2 

0.477591 

0.5672174 

0.474982 

1.034340 

3 

0.475409 

0.5679352 

0.475307 

1.033512 

Initial Marking 

r 

xr i 



^2 

X 

0.568095 

0.5698403 

0.441979 

0.8121824 

2 

0.486450 

0.666298 

0.473333 

0.7622874 

3 

0.476178 

0.6735495 

0.475281 

0.7590964 

4 

0.475474 

0.6740473 

0.475411 

0.7588733 


Table 6.15: Results Using Hierarchical Decomposition for Pipeline System 


Tangible State Space 

| marking 


E&a 

Esm 

E§a 


5 

30 

45 

12 

mum 

9 

27 

27 

12 

Number of Evaluations 

marking 


E&9 

E&a 

E^I 

mmm 

6 

4 

7 

4 

msm 

10 

10 

10 

7 


Table 6.16: Tangible State Space for Hierarchical Decomposition and Number of 
Evaluations 

From Table 6.16 we see that the state space is reduced by more than two orders 
of magnitude: For the original marking, the error introduced during the aggregation 
is —7.3% (—3.13% for the modified initial marking). 


u 
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6.3.7 Comparison 

Table 6.17 shows a summary of the results. The iterative aggregation methods 
introduce a very small error into the final result, and at very reasonable computa- 
tional efforts. The only method that poses major convergence problems is state 
dependent delay equivalence, which often fails to converge. RTA converges particu- 
larly fast, even when using “bad” initial values for the iteration. FEA introduces an 
unacceptably large error in the aggregation. This is due to the strong dependence 
of the mean completion time on the interarrival process. 


Initial Marking Mq 1) 

Method 

Throughput 

Error 

State Space 

Iterations 

original net 

0.51319 


4500 

— 

RTA 

0.49967 

■SHE! 

180 

3 

FEA 

0.38897 

EeMEJIKl 

60 

— 

Marie’s Method 

0.50413 


180 

7 

Delay Equivalence 
(state dependent) 


mm 

180 

9 

Delay Equivalence 
(state independent) 

0.50068 


180 

6 

Initial Marking 

original net 

0.49036 

— 

2916 

— 

RTA 

0.48334 


108 

4 

FEA 

0.37797 

-24.20% 

36 

— 

Marie’s Method 

0.48738 

-0.61% 

108 

7 

Delay Equivalence 
(state independent) 

0.478294 

-2.49% 

108 

6 


Table 6.17: Comparison of the Approximation Results for Pipeline System 


6.4 Kanban System 

The following is an application taken from the field of manufacturing systems. 
Figure 6.2 shows a 6 stage Kanban system. 
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The just-in-time philosophy for the control of manufacturing systems consists 
of producing just the needed parts at each production stage in just the right time. 
Kanban control is a way to implement a just-in-time manufacturing system. A 
Kanban is a ticket that accompanies a part through several stages of the production 
system. When a part of a given stage is consumed by the succeeding stage, the ticket 
is sent back to trigger the production of a new part. Petri nets have been shown 
to be well adapted to provide a unified modeling of decision-free Kanban systems 
[MFDD89] and most models can easily be represented by MGs. 

In the MG system of Figure 6.2, places fi (* = 1 ... 6) represent the available 
space in the work area (number of Kanban tickets), p,- the parts awaiting service, 
Wi the parts which have finished processing at transitions <p,- and c, the available 
number of machines at each stage. In this MG system, we assume that a transition 
t enabled K times in a marking M (i.e., K = max{& | M > k Pre[t]}) works 
at conditional speed K times it would work in the case it was enabled only once 
( injinite server semantics) [RP84, Zub85, HV85]. 

Here c, limits the enabling degree of twi, i.e., tu;, is enabled min{A/(p,), M(c,)} 
times. For the sake of simplicity we assume that all machines have exponentially 
distributed processing times with unity mean. The exact throughput is 1.496715, 
computed with the underlying CTMC which has a total of 93843 states (18040 
vanishing and 75803 tangible markings using the tool GreatSPN [Chi85]). 
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For single level decomposition, the net is split into two subnets by a single SISO 
cut defined through the interface places p 4 and / 4 . As before, AS i is defined to 
the left of the cut while AS 2 is defined to the right of £ A . By solving LPP3 (Section 
4.3.2, we find that a possible solution for the marking of the interface places is: 
A/(/ 4 ) = 4; M{pa) = 0, i.e., the customer population is K = M {q 2 \ = / 4 ) = 4. 

6.4.1 Response Time Approximation 

Table 6.18 shows the results using RTA. v45i has a total of 1285 states (380 
vanishing and 905 tangible markings) while AS 2 has a total of 335 states (210 
vanishing and 125 tangible markings). The introduced error is -1.47%. 

The reduction is by a factor of 73 (=93843/1285) when the total state space 
is considered and by a factor of 47 (=18040/380) if only the tangible state space is 
considered. 


nl = io.o 

r 


U r ~ L 

f*2 



~r 

1.573246 

1.574060 

1.308050 

1.706777 

2 

1.480033 

2.058081 

1.471149 

1.690805 

3 

1.474889 

2.072899 

1.474685 

1.690446 


Table 6.18: Results Using RTA for Single Cut at Kanban System 


6.4.2 Flow Equivalent Aggregation 

Similar to the previous example, the throughput of is the same for all 
possible customer populations. Here, the merged interface places form a selfloop 
with r 2 , i.e., all tokens are trapped. Table 6.19 shows the conditional throughputs 
as a function of the number of tokens. 

The throughput obtained using FEA is equal to 1.26180, i.e., the error intro- 
duced by the aggregation is —15.70%. 
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Conditional throughput ASi = 1.573250 
Number of tangible States = 50, n= 1, 2, 3, 4 

n 

conditional throughput AS 2 

State Space AS 2 

i 

0.956173 

34 

2 

1.587574 

42 

3 

1.652232 

50 

4 

1.697872 

58 


Table 6.19: Results Using Flow Equivalent Aggregation at Kanban System 
6.4.3 Marie’s Method 

Table 6.20 shows the results obtained by using Marie’s method (Section 5.3). 
All initial values were taken to be one. 


All initial values 1.0 

r 

* 

1 

1.839026 

2 

1.408609 

3 

1.523192 

4 

1.484177 

5 

1.495074 

T 

1.491380 

7 

1.492412 

8 

1.492063 


Table 6.20: Results Using Marie’s Method at Kanban System (Throughput Shown 
for the Basic Skeleton System) 


The error introduced by the aggregation is —0.31%. Convergence did not 
present a problem. 


6.4.4 Delay Equivalence 

Table 6.21 show the results using state independent delay equivalence aggre- 
gation (Section 5.4). For this example, convergence depends on the initial values of 


a 
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the iteration, e.g., using n\ = /x° = 1.0 delay equivalence does not converge. The 
number of iterations required is significantly higher than in RTA, while the accuracy 
is comparable (the introduced error is —0.34%). 

In RTA and delay equivalence, the structure and marking of the aggregated net 
systems are the same, therefore the state space is also identical. Delay equivalence 
uses the conditional probabilities of customers in both aggregated net systems at the 
current iteration. We believe that this makes delay equivalence less robust, as an 
unfortunate combination of initial values can cause unstable oscillations. 


Pi = 1.0 n't = 4.0 

r 



Xi 

02 

i 

0.995363 

28.328484 

1.694930 

1.685173 

2 

1.473015 

1.223924 

1.113728 

2.283125 

3 

1.560674 

1.776035 

1.388686 

1.461586 

4 

1.370634 

4.147622 

1.650823 

1.673865 

5 

1.469159 

1.718533 

1.367792 

1.979070 

6 

1.537082 

1.854180 

1.414690 

1.646106 

B 

1.459144 

2.740666 

1.579719 

1.689514 

8 

1.474462 

2.038670 

1.466410 

1.830199 

9 

1.512249 

1.973474 

1.449564 

1.727731 

10 

1.486418 

2.313612 

1.523348 

1.713405 

11 

1.482097 

2.157780 

1.493647 

1.769801 

12 

1.498054 

2.067290 

1.473355 

1.748789 

13 

1.492434 

2.183312 

1.498943 

1.731198 

14 

1.487435 

2.171140 

1.496441 

1.750136 

15 

1.492805 

2.118218 

1.485076 

1.749755 

16 

1.492700 

2.149813 

1.491957 

1.740347 

17 

1.490069 

2.160998 

1.494324 

1.745181 

18 

1.491431 

2.139305 

1.489700 

1.747555 

19 

1.492092 

2.144376 

1.490793 

1.743924 

20 

1.491079 

2.152818 

1.492596 

1.744530 

21 

1.491248 

2.146035 

1.491149 

1.746062 

22 

1.491677 

2.145147 

stop 



Table 6.21: Results Using Delay Equivalence at Kanban System 
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6.4.5 Hierarchical Decomposition using RTA 

Figure 6.2 also shows the cuts leading to a hierarchical decomposition. The 
markings of the aggregated nets are obtained from Algorithm 2 (Section 4.6). Cut B 
partitions the original net system into the aggregated net systems .A5} and .45} at 
Level 1. On the next level, only .45} is partitioned by C to form .45^ and -45} 2 . 
X\ is computed at Level 2 using 45} x and .A5} 2 . The initial rate for n 2 was chosen 
as 20.0. The iteration results are shown in Table 6.22. 


n b 2 = 20.0 

r 

xr i 


x; 

A 

1 

1.49408 

1.49467 

1.42838 

2.69747 

2 

1.45576 

1.52824 

1.45317 

2.66603 

3 

1.45433 

1.52968 

1.45422 

2.66471 


Table 6.22: Results Using Hierarchical Decomposition at Kanban System 

Here, the state space is reduced by more than two orders of magnitude: AS n 
has a 22 states, AS has 230 states while AS 2 has 58 tangible states. The error 
introduced during the aggregation is —2.84%. The aggregated net systems have to 
be analyzed repetitively, e.g., during the iterative process, AS\ was analyzed a total 
of 3 times while 45^ and AS\ 2 were analyzed 6 times. 

6.4.6 Comparison 

Table 6.23 shows a summary of the results. 

With the exception of FEA, all methods show good accuracy. RTA provides 
a significant computational saving by showing the fastest convergence. In this ex- 
ample, the initial values for delay equivalence have to be chosen carefully in order 
to achieve convergence. The number of required iterations for this method is high 
when compared to RTA. 
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| original net: X = 1.496715, Tang. State Space = 75803 

Method 

Throughput 

Error 

State Space 

Iterations 

RTA 

1.47469 

-1.47% 

380 

3 

FEA 

1.26180 

-15.70% 

58 

— 

Marie’s Method 

1.49206 

-0.31% 

380 

7 

Delay Equivalence 

1.491677 

-0.34% 

380 

22 

Hierarchical RTA 

1.45422 

-2.84% 

230 

3 


Table 6.23: Comparison of the Approximation Results for Kanban System 
6.5 Conclusions 

In this chapter we have presented numerical results for the approximation 
methods which were introduced in the previous chapter. For the particular examples, 
we can draw the following conclusions: 

• FEA has the lowest state space of the CTMC and is therefore computationally 
very efficient. The drawback is obvious: in the examples, there exists a strong 
dependence of the service time on the arrival process, therefore as outlined in 
Section 5.4.3. the accuracy can suffer. 

• Delay equivalence shows good accuracy, provided it converges. The choice of 
proper initial values is critical for convergence. This is particularly true for 
state dependent delay equivalence, which failed to converge in almost all cases. 
This method uses the token distribution of both aggregated net systems at the 
current iteration. In the initial step, these distributions have to be determined 
with the initial values of the iteration. A poor choice will lead to instabilities 
(and eventually non-convergence) in the iteration. 

• RTA shows similar accuracy as delay equivalence, but at a greatly reduced 
computational cost. RTA is insensitive with respect to the initial values and is 
the one which requires the least amount of iterations. RTA does not make use 
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of the token distribution in the subnets, and only uses the throughput. We 
believe, that this accounts for the good robustness of the method. Hierarchical 
decomposition can possibly lead to a larger error. 

• Marie’s method shows the best accuracy for all SISO cuts, with a reasonable 
computational cost (about twice as many iterations as RTA). Note that Marie’s 
method uses all information about the token distribution in the aggregated 
net systems, i.e., it uses more information than RTA. On the other hand, 
convergence is slower. 

The savings in the state space immediately leads to a great saving in the time 
required to obtain the solution. For the Kanban example, the time needed on a SUN 
4/330 SPARC with 16 Mb RAM to compute the exact solution of the original system 
was around 2 h. 30 m while the approximate solution computed with 3 iterations 
was obtained in less than 1 minute using SPNP [DBCT85]. 

At this point it is important to state that a Kanban system with 8 stages 
generates such a large state space that the computation of the exact throughput is 
impossible because of the lack of memory. Its decomposition into two aggregated 
nets of 4 stages plus the environment transitions (tx and r 2 ) leads to a computation 
in less than two minutes! 


CHAPTER 7 

Macroplace/Macrotransition Nets 


7.1 Introduction 

In the previous chapters we have introduced approximation methods for 
stochastic MGs. The modeling power of this net subclass is restricted, because 
decisions are forbidden. In this chapter, we introduce a new class of Petri nets, 
Macroplace-Macrotransition-nets (MPMT-nets), a subclass which allows limited 
choice, concurrency and sharing of resources [DJS92]. MPMT-nets have a greatly 
increased modeling power over SMs and MGs by iteratively combining Macrotran- 
sitions (where a single transition has been expanded to a MG) and Macroplaces 
(where a place has been refined to a SMs). By exploring the qualitative properties of 
MPMT-nets, we show that the previously introduced approximation technique for 
SISO cuts are applicable to this subclass of PNs. 

7.2 MPMT-nets: Motivation and Definition 

Even if some practical systems can be modeled by Fork/ Join queueing net- 
works with blocking or closed queueing networks (thus essentially strongly con- 
nected marked graphs state machines, respectively), the interleaving of choices and 
synchronization is fundamental in many cases. For example, the structure of flow 
line models with unreliable machines are neither MGs, nor SMs (e.g., see Figure 
8 . 2 ). 

A traditional way of interleaving choices and synchronization in a controlled 
way is given by Free Choice nets. The behavioral and structural analysis of FC nets 
is particularly elegant and well understood (see [ES90a] for a survey). Nevertheless, 
classical schemes in manufacturing in which two or more processes share a common 
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resource (e.g., a robot) cannot be modeled with FC net systems. 

Here we introduce a different combination of the basic structures of SMs and 
MGs. The practical modeling with this new subclass of net systems has some 
advantages in the field of manufacturing with respect to the use of the well known 
subclass of live and bounded free choice net systems allowing e.g., the limited sharing 
of resources and some synchronization at choice places. 

The new net subclass is called MPMT-nets, because they can be obtained by 
using the macroplace. (MP) and macrotransition (MT) reduction/refinement rules. 
The MP-rule is strongly related to the concept of SMs [Sil81], while the MT-rule is 
its reverse-dual and therefore, strongly related to MGs (see [ES90a] for an integrated 
consideration of both rules). 

The MP (MT) rule consists of the substitution of a given SM (MG) subnet 
by a single place (transition). Only a particular case of MP (MT) reduction rule 
is considered here: a unique way-in place (transition) and a unique way-out place 
(transition) exists. 

Way-in places are those that can be used to “enter” into the subnet, and 
way-out places are those through which we can “leave” it. More formally: 

Let = (P\ r, F') be a subnet of M = (P, T, F ) 

[i.e., F' = FC\ ((P' x D U {T x P'))}: 

(a) p' € P' is a way-in place of j\P iff *p' fl (T \ T') ^ 0 

(b) p' € P' is a way-out place of jV iff p * fl (T \ T') ^ 0 

where (T \ T') means the set T minus the set T' . 

Way-in and way-out transitions are defined analogously. 

Definition 7.2.1 Let jV' be a subnet of tf . j\f'= ( P\T F ') is reducible to a place 

if: 


a 
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Figure 7.1: Macroplace Reduction Rule 



Figure 7.2: Macrotransition Reduction Rule: Disregarding the Marking, the Subnet 
is the Reverse-dual of that in Figure 7.1 

(a) N' is a state machine containing one way-in place, p'-, and one way-out place, 
Po (p\ = Po is possible). 

(b) For every p / € P‘ , there exists at least an F'-path from: 

(b.l) the way-in place p' { to p’ . 

(b.2) p 1 to the way-out place, p' Q . 

The next definition expresses the standard notion of substitution of a net by 
a place, x. See Figure 7.1 for an example. 
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Definition 7.2.2 Let (Af,M 0 ) be a system and Af = ( P',T F,) a subnet of Af = 
(P,T;F) -reducible to a place. The net Af r = (P r ,T r ; F r ), with: 

• Pr = (P \ P') U M 

• T r = (T\ T) 

• F r = (F n ((F r x T r ) U (T r x P r ))) u F ir, where: 

- (f,ir) € F* iff ( t,p'i ) € F 

- (?r,t) € F, iff (ff 0 ,<)€F 

is a macroplace reduction of Af, and : r is fAe macroplace that replaces Af' . The 
system ( Af T ,M T ) where Af r is the reduction of N and M r is given by: 

• M r (p ) = M 0 (p) if p^v 

• M r (ir) = -Wo(p) 

is called a macroplace reduction of { Af,Mo ) . 

The value of the macroplace concept lies in the following result [Sil81]: The 
reduction of a macroplace preserves liveness and the bound of the places of the sys- 
tem. 

Moreover, in our particular case (single way-in/single way-out place), re- 
versibility is also preserved. Therefore: 

Property 7.2.1 The reduction of a macroplace preserves liveness, the bound of 
places ( thus boundedness) and reversibility. 

At the structural level, the reverse-dual (i.e., reversing the arcs, changing 
places to transitions and vice versa) of the MP-rule is the MT-rule. 

Definition 7.2.3 Let Af be a subnet of Af. Af = ( P\ T F') is reducible to a 
transition if: 


ft 


Ill 


(a) -A/”' is a marked, graph containing one way-in transition, t\ and one way-out 
transition, t' 0 (t' = t’ Q is possible). 

(b) For every t' € T* there exists at least an F'-path from : 

(b.l) the way-in transition, t! v to t'. 

(b.2) t‘ to the way-out transition, 1f Q . 

Definition 7.2.4 Let ( N,M 0 ) be a system and Af = ( P',T'\F ') a subnet of Af = 
(P, T; F) reducible to a transition. The net j\f r = (P r , T r ; F r ) with: 

• P r = (P\P') 

• T r = (T \ T) U { 6 } 

• F r = (P n (( P T X T r ) U (T r x P r ))) U Fg, where: 

- (p,8)6Fg iff(p,tQ € F 

- ( e,p ) e F e ijf(? 0 ,p) e F 

is a macrotransition reduction of Af and 9 is the macrotransition that replaces Af' . 
The system (Af r , M r ) is called a macrotransition reduction of ( Af,Mo ) where: 

Mr = M 0 \ P\P' + K ■ Post(t' 0 ) 

and K is the minimum number of tokens found in the different paths from t\ to t' Q . 
In other words, M r is the restriction of M 0 to the remaining places plus K tokens 
per output place of t' Q . 

Figure 7.2 shows a macrotransition reduction. There exists one token in place 
c, thus the unique path from transition 3 to transition 1 has one token and K = 1. 

The value of the macrotransition concept lies in the following result that ba- 
sically derives from MG systems theory: 
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(a) A shared resource is used by two different processes (one with internal 

concurrency) 



(c) An MP-reduction of the System in (b) 

Figure 7.3: The Original Net is not Free-Choice, but MPMT (and simple) 

Property 7.2.2 In the absence of an unmarked circuit (or p-semiflow ) in Jf', the 
reduction of a macrotransition preserves liveness, boundedness (but not the bound 
of the system) and reversibility. 

Now the MPMT -nets can be recursively defined as follows: 

Definition 7.2.1 A macroplace/macrotransition (MPMT) net is an ordinary net 
such that, disregarding the conditions on the marking in the MP- and MT -rules, it 
can be reduced to a simple self-loop of one place and one transition by recursively 
applying the MP and MT reduction rule (Definition 7.2.4 an< ^ 7.2.2). 



Token initial*** Gat Part Description G.» ID 


Unload Part 
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(a) Petri net representation of a controller [AJD90b] 



(b) MP-reduction of the net system in (a) 

0 ? 

(c) MT-reduction of the net system in (b) 
Figure 7.4: An MPMT and FC net system 


Figures 7.3 and 7.4 show the reduction (and refinement) processes leading 
to two different MPMT-nets. The reduction (refinement) also verifies the marking 
conditions on the MP- and MT-rules, therefore, both net systems are bounded, live 
and reversible. 


Property 7.2.3 The. reverse and the dual (thus the reverse-dual also) of an MPMT- 
net is also an MPMT-net. 



Figure 7.5: A Reversible Non-live MPMT-system 
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The net in Figure 7.3 is MPMT but not FC. The net in Figure 7.4 is both 
MPMT and FC. 

How do we recognize if a given net is an MPMT-net? We need to find the 
subnets of a net which can then be reduced to a place or to a transition. For the 
reduction to a place, in [Sil81] an efficient (polynomial) algorithm for this purpose 
is given. It consists of first removing all transitions with more than one input or 
one output arc, which splits the net into one or more SMs. Then simple recursive 
procedures can be applied to each connected subnet to check conditions (b.l) and 
(b.2) from Definition 7.2.1. Now, for the reduction to a transition the same procedure 
is applied, but now first all places with more than one input or one output arc are 
removed. This splits the net into one or more MGs. Conditions (b.l) and (b.2) 
of Definition 7.2.3 can be checked by using simple procedures on each connected 
subnet. The entire process can be done in polynomial time. 

The net in Figure 7.1 (Figure 7.2) is analyzed by applying first the reduction 
of subnets to a transition (place). 

7.3 Basic qualitative properties of MPMT-nets 

From the definitions it is not difficult to observe that MPMT-nets are strongly 
connected. For SMs, strong connectedness guarantees the existence of live markings 
(i.e., structural liveness) and for MGs the boundedness for any initial marking (i.e., 
structural boundedness). On the other side, SMs are conservative nets (i.e., 3 Y > 0 
s.t. Y t • C = 0, thus, structurally bounded), while MGs are structurally live nets. 
Combining these facts, the following important structural properties hold. 

Property 7.3.1 MPMT-nets are: 

1. Ordinary and strongly connected. 

2. Conservative (thus, structurally bounded). 
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3. Structurally live (then also consistent, i.e., 3 > 0 s.t. C • X = 0, because 
they are also conservative). 

The basic structural properties of MPMT-nets have already been presented. 
Behavioral properties can be studied in a straightforward way. 

Property 7.3.2 Let Af be an MPMT-net. (N,Mo) is a live net system iff all p- 
semiflows are marked: 

VY > 0 s.t. Y t • C = 0 , Y t • Mo > 0 

Corollary 7.3.1 Liveness of MPMT net systems can be characterized in polynomial 
time: (Af, Mq) is live iff there exists no unmarked P-semiflow. Analytically, ( jV,M 0 ) 
is live iff there exists no solution for the following linear system: 

Y T • C = 0 

Y > 0 ( 7 . 1 ) 

Y T • Mo = 0 

Corollary 7.3.2 Let Af be an MPMT-net and Af r its reverse. (Af, Mo) is live iff 
( Af r , Mq) is live. 

Property 7.3.3 Let (Af,Mo) be a live MPMT-system. The three following state- 
ments are equivalent: 

i) M € R{Af, Mf), i.e., M is reachable from Mq. 

ii) M = M 0 + C -a, with M € !V n ,5 € Pl m . 

Hi) B t • M = B T • Mo, with B a basis of left annullers of the incidence matrix 
(i.e., B t -C = 0J and M € N n 

According to the above property, M € R(Af, Mq) iff Mq 6 R{Af, M). In other 


words: 
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Corollary 7.3.3 Live MPMT-systems are reversible, but the converse is not true, 
(see Figure 7.5). 

The bounds of places can be computed in polynomial time using Linear Pro- 
gramming Techniques [MJ88]: 

Corollary 7.3.4 Let (J\f,M) be a live MPMT-net system. The behavioral and 
structural bounds of place p coincide: 

B(p) = max{A/(p)|M <E R{Af,M 0 )} = 

SB{p ) — max{M(p)|iV/ = Mo + C • a > 0 , 5 > 0} (7-2) 

Property 7.3.3 relates directly to Theorem 3.4.3, while Corollary 7.3.4 is equivalent 
to Corollary 3.4.3 for MGs. 

7.4 Well-formed SISO cuts 

While in MGs, every SISO cut was a well-formed cut, in MPMT-nets we have 
to restrict the types of cuts which we can perform. Specifically we only cut the net 
at persistent places, i.e., places where |p*| = 1, i.e., once a token has entered an 
interface place, the routing out of it is deterministic. 

Definition 7.4.1 (Well-formed SISO MPMT cut) A cut £ is well-formed if 

l?X2 E = l?2ll = 1* 

With the above properties of structural liveness, reachability and the marking 
bound of places, we can apply the results of the previously introduced methods 
directly to MPMT-nets. Note that in general the vector of the visiting ratios can 
not be computed from the net structure. 

As a final comment, we can now give a particular case when to use FEA in 
MPMT-nets. From Property 2.3.1, we know that FEA leads to exact aggregation 
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in PFQNs. Therefore, if we aggregate a macroplace (i.e., a subnetwork which is in 
product form) by flow equivalence, we will be able to replicate at least the MCT 
of the subnet. Nevertheless as we will see, the change in the coefficient of variation 
might still introduce considerable error in the aggregation. 

7.5 Summary 

In this chapter we have introduced a new subclass of Petri nets with extended 
modeling capabilities over SMs and MGs. Having established the structural sim- 
ilarity of MGs and MPMT-nets for the marking bounds and the reachability of 
markings, we can use any of the previously discussed approximation methods for 
SISO cuts in MPMT-nets without any modification for the iterative part. This will 
be demonstrated in the next chapter. 



CHAPTER 8 

Examples for SISO Cuts in MPMT-nets 


8.1 Introduction 

In this chapter, we present some numerical examples for SISO-cuts in MPMT- 

nets. 

The first example considers the performance analysis of manufacturing transfer 
lines (or simply flow lines) which has received considerable attention in the literature 
due to their economic importance. In the first survey of this area [Koe59], the earliest 
papers date back 40 years, while today a vast amount of literature exists. In the 
most recent survey [DG91] 180 papers are cited which are directly related to their 
modeling and analysis. In the context of Petri nets, flow lines with reliable machines 
(FLRMs) can be modeled by marked graphs (MGs), while flow lines with unreliable 
machines (FLUMs) involve choice (i.e., the break down of machines) and therefore, 
can only be modeled using MPMT-nets. 

An analytic solution for the very particular case of a short two machine, one 
buffer flow line with exponentially distributed machining, failure and repair time has 
been found [GB81]. In [CG87] this analytic solution of a short line has been used to 
devise an efficient approximation for longer flow lines whose exact analysis by solving 
the underlying CTMC becomes computationally intractable due to the problem of 
state explosion. Another approximation method is described in [DDL89] which 
employs an approximation based on continuous material flow. These approximation 
methods are problem specific, i.e., only suitable for the analysis of flow lines. For 
this example, we compare Response Time Approximation (RTA), Flow Equivalent 
Aggregation (FEA), Delay Equivalence (DE) and Marie’s method. 

In the second example, we examine an MPMT-net, whose structure is that 
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Figure 8.1: Flow Line with N Machines 

of an uninterpreted dataflow graph [KBB87]. Here we use a hybrid approximation 
approach. In the first step a macroplace is reduced to a single transition by FEA. 
In the second step, the reduced net is analyzed by RTA and Marie’s method and 
the accuracy is compared. 

8.2 Modeling of Transfer Lines with Petri Nets 

Figure 8.1 shows a flow line with N machines. Parts enter through Mi and 
leave through We denote the z-th machine as A/,- and the buffer which lies in 
between A/,- and A/, + j a s B{. C{ is the capacity of i?,, i.e., the total number of parts 
which can be stored in between Mi and Af, + i . The current buffer content is denoted 
by c,- while "U denotes the vector of the buffer capacities. 

We now give a short review of flow line models and how they can be modeled 
using Petri nets. We concentrate on FLUMs with operation dependent failures, i.e., 
failures that can only occur when the machine is working. Time dependent failures, 
i.e., failures which do not depend on the machining operation and which can occur 
even if the machine is blocked or starved are not considered, but could also be 
incorporated into the Petri net model. Figure 8.2 introduces the notation for all of 
the transfer line models. 

A common assumption of these models is that the first machine is never starved 
and the last machine is never blocked. The machining operation of Mi is represented 
by transition m, with rate A,-, failure by /, with rate fii and the repair by r, with rate 
di. Note that the failure-repair model is a Macroplace. Pg t represents f? t . If P\[ x 
is marked, A/,- is currently idle (i.e., starved), whereas W t is marked if the machine 
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Figure 8.3: Blocking Before Service Parts not Occupied (BBS-PNO) Flow Line 

is currently processing a piece. is marked if the machine is down while 6/, is 
marked for a nonzero period of time if A/,- is blocked. Where no confusion arises, 
the names of places and transitions are omitted from the Petri net models. 

In [DG91], flow line models were divided into the following categories: 

• Blocking After Service (BAS)(Figure 8.2): 

if at the instance of completion of a part at Af, the part cannot be put into 
the downstream buffer B; because it is full, the machine is blocked. Mi will 
become operational only if a part is transported from B; to M 1+1 freeing one 
buffer space. 

• Blocking Before Service, Parts Not Occupied (BBS-PNO) (Figure 8.3): 
here the part is not admitted into Mi if no buffer space is available in B,-. 
The BBS-PNO line with buffer capacity ^+1 has the same throughput as 
a BAS line with capacity U (1 is a vector of appropriate size whose entries 
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are all one). This result was first established by Perros et al. [OP86] for the 
case of exponential timing, and has also been shown for the case of general 
distributions [DLT91]. Note that in BBS-PNO, if P\fi is marked, A/,- is starved 
of blocked. 

• Blocking Before Service, Parts Occupied (BBS-PO) (Figure 8.4): 

In this model, the part is admitted into the machine, but operation does not 
start until a space in the output buffer is available. 

Actually BAS is most likely to be encountered in manufacturing systems [DG91], 
e.g., in metal cutting operations there is no reason why a part should not be allowed 
into the system even if no output buffer space is immediately available. 

Flow lines with reliable machines can be constructed by simply deleting the 
failure circuit, i.e, /,, r,- and Pj. from all machines. 

8.2.1 Choong and Gershwin Approximation in Flow Lines 

The basic idea is to decompose an N machine transfer line into N — 1 modules 
[CG87] consisting of the buffer Bj , M u (j) (the upstream machine) and Md{j) (the 
downstream machine), which can analytically be solved by the method outlined in 
[GB81]. In Figure 8.5, the five machine flow line is decomposed into four modules. 
Each of the modules is characterized by six parameters, three for each machine: A„, 
fi u , 9 U and A<f, pd, &d- The goal of the approximation is to find these 6 parameters 
such that the pseudo-machines capture the upstream and downstream behavior of 
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Figure 8.5: Decomposition Method of Choong and Gershwin 

the flow line, i.e., the rates with which parts are processed if the line upstream is up 
(A£), the rate with which the upstream part of the line goes from the up to down 
state (p 3 u ) and vice versa A similar discussion holds for A^, p d and 6 d . 


The stand-alone efficiency of a machine is given by: 


Pi = 


A,fl, 

+ Pi 


( 8 . 1 ) 


In the line, the production rate will, in general, be smaller because there is a nonzero 
probability that the machine is blocked or starved (or both). The efficiency E of 
the machine in the line is therefore given by 


Ei = pi prob^-x > 0 and (c,- < C,- and Mi not blocked)] (8.2) 


The conservation of flow implies that 


E\ — E 2 = ... E s -x ~ Eltf (8-3) 

If the probability of the event “machine blocked and starved at the same time is 
small, equation (8.2) can be simplified to 

Ei — pi( 1 — prob[c,_i > 0] — prob[c,- < C t - and M, not blocked]) (8.4) 
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The next step is to examine A, p and 9 in the modules and how they relate to the 
machine parameters. From the examination of the interruption and resumption of 
flow at the modules and together with equations (8.3) and (8.4) a set of six equations 
per module can be derived, which then have to be solved iteratively. The method 
breaks down if the line is strongly unbalanced and p,- pj. 


8.2.2 Experimental Results: Three Machine Transfer Line 


The flow lines which are analyzed in this section were previously described 
and analyzed in [CG87]. The lines are of the type BBS-PNO, nevertheless, here the 
flow lines were modeled as BAS with the appropriate change in the buffer size as 
outlined in Section 8.2. 

As the convergence criterion, we chose that the throughputs obtained during 
the iteration do not change by more than 0.1%. 

In order to interpret the results, it is useful to look at the Moment Gener- 
ating Function of the time x it takes to move a token from place W; to 6/,. The 
moment generating function, T(s) can be derived by using the techniques outlined 
in [GDZ90]. First we transform the macroplace into a construct where the conflict 
is given by immediate transitions according to Theorem 3.5.1 and is presented in 
Figure 8.6. 

The transformed macroplace forms a loop structure. W^s) represent the loop, 
while W 2 (.s) represents the “gain” after the loop as shown in Figure 8.7. First we 
find the stochastic “transfer function” Wi(s) of the loop given by {/*,/*, r}, i.e., 
two exponentials in series times the probability that path is taken. 




ffO 

(«-(/* + A))(* ~ *) 


(8.5) 


VV 2 (s) is the “gain” after the loop and is simply an exponential times the probability 
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m (A) 


m ‘ (s+7) m ‘ (A + m) 




Figure 8.6: Transformation of Macroplace in Unreliable Machines (Probabilities and 
Rates of Transitions in Parentheses) 


Wi(s) 

CHD— 


XT 

W 2 (s) 


Figure 8.7: Loop Structure for Finding Moment Generating Function 
the path is taken. 


W 2 (s) = 

The overall transfer function is given by 

rw- 


s-(fi + X) 


W 2 (s) 


(8.6) 


(8.7) 


1 - W t (s) 

By substituting equation (8.5) and (8.6) into (8.7), the moment generating function 
is obtained: 

A (9 - s ) 


T(*) = 


( 8 . 8 ) 


— s(X -f* ft -h $) + X9 

By differentiating the T(s) with respect to s , it is now easy to determine the first 
two moments and obtain the mean and variance: 

\2 


E[x] =-= p— 

P 0 + fi 


E[x']= 2 ^ + {0 + rf 


6 2 X 2 


(8.9) 
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The coefficient of variation ( CV ) is a measure for the dispersity of the probability 
density function (pdf) (for the case of an exponential pdf CV = 1.0). For the model 
at hand, the CV is given by: 

jjgj - £[*)* /. . 2M 

1 \ £[*]’ V + 

Note that with this information it is possible to transform a transfer line with 
unreliable machines to a model with reliable machines by substituting the machining 
rate in the FLRMs by the pdf obtained from eq. (8.8) [DG91]. This corresponds to 
finding the flow equivalent from MP, to 6/,-. By doing so, we disregard the higher 
moments of the pdf, by substituting an exponential where CV = 1.0. 

The parameters of the three machine line are shown in Table 8.1. This short 
flow line is analytically tractable (the underlying CTMC has 798 tangible states). 
Rather than comparing our results to the simulation published in [CG87], we chose 
to compare our approximation results for the short line to the exact solution of the 
CTMC (which by and large confirms the simulation). 

The results of the approximation are shown in Table 8.2. Cut £* traced 
through P Mi and bl{ introduces the single level decomposition as shown in Figure 8.8 
(here only the cut, but not the aggregated net systems are shown). The first four 
columns show the throughput obtained with the exact solution of the underlying 
CTMC, together with the mean and coefficient of variation of the effective machin- 
ing time of machine 2. Columns 5 and 6 show the number of iterations as well as 
the error introduced by using RTA (remember that because of Property 5.4.2, all 
presented iterative methods yield the same result for K = 1). Columns 8 through 
11 show the results using a hierarchical decomposition approach as shown in Fig- 
ure 8.8. We apply the first cut (f x ) at buffer 1 which results in AS X , ASj an< ^ BS 1 . 
At the next level, we apply cut f 2 at buffer 2 in AS\ (now at level 1) to yield the 
aggregated net systems -4«S 21 , -4 «S 2 i an< ^ BS 2 . Figure 8.8b shows the resulting tree 
with the basic skeletons. 
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i 

Machine Parameters 

C 


$i m Xi pi CVi 


T 

0.05 0.03 0.5 0.3125 2.38 

8 

2 

0.06 0.04 

8 

3 

0.05 0.03 0.5 0.3125 2.38 



Table 8.1: Parameters for 3 Machine Transfer Line 

Column 12 shows the results obtained by using FEA at the macroplaces rep- 
resenting the machine failures and repairs. In the three machines, W t , m,, /*, Pj. 
and Ti are substituted by a single transition with the stand alone machining rate 
which is computed with equation 8.9. Finally, column 13 shows the results obtained 
using the decomposition approach by Choong. 

For the SISO cut £*» *4<Si has 59 tangible states (AS 2 has 21). In the hierar- 
chical decomposition, *4«S} and AS\ 2 have 19 while -4.5^ has 180 tangible states. 


8. 2.2.1 Interpretation of Results 

All approximation methods for the SISO cut at machine 2 yields a bad ap- 
proximation of the throughput. Three factors contribute: 

1. The number of tokens at that cut is low ( K = 1). 

2. In [ABS84], Aggrawal et al. observed that response time preservation is an 
effective technique for low to moderate coefficients of variation (CV < 5) and 
a large number of customers (K > 5). From Table 8.2 we can see that the 
error increases rapidly as A 2 (and therefore CV 2 ) increases. 

3. An additional error is introduced by cutting the fastest machine. 

The results in Table 8.2 derive from an MPMT-net. In order to examine 
the source of the rather large error, let us now analyze a similar marked graph 
system, i.e., let us analyze a transfer line without failures (a marked graph!) whose 
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- 0.1 

0.4 
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EB 
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.2454 

7 
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16 
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.2489 

8 
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19 
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.2497 
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B! 

1 

o 

- 

- 
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3.2 
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1.50 

4.6 

.2538 

10 
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- 

- 

16.8 

4.0 

3.0 

1.80 

5.0 

.2549 

11 

18.1 

mm 

Bl 

- 5.4 


- 

16.4 

4.4 


Table 8.2: Results for Three Machine Transfer Line 

machining rates are the stand alone machining rates p,- (equation 8.9) of the three 
machine, two buffer transfer line. Again, the SISO cut is performed at P\[ 7 and bl 2 . 
Table 8.3 shows the results (p 2 is the effective machining rate of machine 2, this is 
the value which was used in the marked graph system). 

From these results, we can draw two conclusions: 


• The aggregation in the marked graph system leads to a reasonable accuracy, 
i.e., the maximum error is only 5.1%. From a stochastic perspective, the 
difference between the system corresponding to Table 8.2 and Table 8.3 is a 
change in the coefficient of variation of the machining time. In the MPMT-net 
(Table 8.2), the coefficient of variation is larger than one, while in the marked 
graph system (Table 8.3), all coefficients of variation are equal to one. 
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Analytical 

| SISO Cut U 

a 2 

Pi 

^exact 


Err 

0.1 

0.06 

.0600 

.0600 

- 

0.2 

0.12 

.1200 

.1200 

- 

0.3 

0.18 

j .1794 

.1794 

- 

0.4 

0.24 

.2331 

.2327 

.1% 

0.5 

0.30 

.2689 

.2690 

- 

0.6 

0.36 

.2853 

.2887 

1.2% 

0.7 

0.42 

.2916 

.2986 

2.4% 

0.8 

0.48 

.2940 

.3038 

3.3% 

0.9 

0.54 

.2950 

.3067 

4.0% 

1.0 

0.60 

.2955 

.3084 

4.4% 

1.2 

0.72 

.2961 

.3101 

4.7% 

1.5 

0.90 

.2964 

.3111 

5.0% 

1.6 

0.96 

.2964 

.3112 

5.0% 

2.5 

1.50 

.2967 

.3118 

5.1% 

3.0 

1.80 

.2967 

.3119 

5.1% 


Table 8.3: Results for Three Machine Transfer Line Modeled with Reliable Machines 

Therefore, we conclude that the poor accuracy of the SISO cut through the 
MPMT-net is due to the large coefficients of variation of the equivalent ma- 
chining times. 

• From Table 8.3, it can be seen that as soon as A 2 increases beyond 0.52 (where 
the flow line is approximately balanced, i.e., p x = p 2 = Pz — 0.3125) the error 
increases rapidly. Therefore, we conclude that a cut at the fastest machine 
can lead to a larger error. 

The hierarchical decomposition using RTA also yields a substantial error. The 
problem with Marie’s method however, is that the number of required iterations is 
larger than RTA and that it fails to converge for large values of A 2 (A 2 > 1.6). The 
iteration simply cycles and does not converge. 

Aggregating the macroplaces yields a poor result (column 12 in Table 8.2). 
All we did here was to reduce the coefficient of variation to unity. As we reduced a 
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8 Machine Parameters 

7 Machine Parameters 
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0.05 

1.6 

1.4222 

1.34 

- 

- 

- 

- 

- 
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Table 8.4: Parameters for Long Flow Lines 

macroplace, the MCT of the subnet is independent of the interarrival process. Here 
we see clearly that the dependence of the MCT on the interarrival time is not the 
only factor which introduces an error in the aggregation, but that the coefficient of 
variation can have a strong impact on the throughput. 

For the SISO cut £4, FEA yields a better approximation than the presented 
iterative methods. This is coincidental and due to the interplay of the change of the 
coefficient of variation of the interface transition and the dependence of the MCT 
on the interarrival time of tokens. This relatively good results is specific to this 
example, as we will see in the next section. 


8.2.3 Experimental Results: Long Transfer Lines 

Table 8.4 shows the machine parameters for the 7 and 8 machine line. Both 
lines were analyzed by SISO cuts and hierarchical decomposition. Table 8.5 and 8.6 
show the numerical results of the approximation. Finally Table 8.7 shows a com- 
parison of the tangible state space for the SISO cuts. 

The hierarchical decomposition of the subnets (Figure 8.9) yields a balanced 
binary tree. For the ease of notation, the aggregated net system containing M, is 
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All Errors in %, DE = Delay Equivalence 
Throughput using Simulation: X = 0.1333 
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Table 8.5: Results for 7 Machine Flow Lines (Number of Iterations are Given at 
Top Level) 
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Table 8.6: Results for 8 Machine Flow Line (Number of Iterations are Given at Top 
Level) 


State Space of (ASi at SISO cuts 

7 Mach, M 4 

7 Mach, B 3 

8 Mach, B 4 

8 Mach, Ms 

ASi 

as 2 

ASt 

as 2 

AS t 

ASi 

ASi 

ASi 

2181 

843 

685 

8133 

9115 

9793 

32211 

1429 


ti 


Table 8.7: Tangible State Space for SISO Cuts at Long Flow Lines 
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denoted by AMj. Obviously, the individual aggregated net systems for the hier- 
archical decomposition of the long lines have the same structure as in the three 
machine case. 

For the SISO cuts at the 7 machine line the quality of the approximation using 
an iterative method (here K = 1) is quite good. The cut at _0 3 is superior to the 
one at M 4 , due to the larger number of tokens in the circuit that is cut. Note, that 
here the coefficient of variation of almost all machines is close to unity. 

For the SISO cuts at the 8 machine flow line, the coefficient of variations of 
the effective machining times are comparatively large which accounts for the error 
at the SISO cut at M 5 . The cut at f? 4 , is superior to the cut at M 5 , because now 
the number of customers is larger (C 4 = 2) than in the cut at M s . Again, Marie’s 
method yields the best SISO approximation while FEA is poor. In the 8 machine 
example we did not use delay equivalence because of the high computational cost of 
analyzing the underlying CTMC and the slow convergence of the method. 

The results of the hierarchical decomposition are acceptable. Nevertheless 
from just looking at the transfer line, it is not possible to infer which cuts (machines 
or buffer) and which method (RTA or Marie’s method) yields the best approxima- 
tion. 

From the above results it becomes clear that single cuts should be performed at 
the buffers , because the number of customers (tokens) is usually higher than at the 
machines. RTA often yields the fastest convergence, while Marie’s method yields 
the best results, although it sometimes fails to converge. 

8.3 Uninterpreted Dataflow Graph 

Figure 8.10 shows the structure of an uninterpreted dataflow as given in 
[KBB87]. The net is MPMT and free-choice. For a physical interpretation of the 
model, the marking has to be safe, nevertheless we analyze the net with the shown 




Original net system 



Level 0: cut at B\ 
Level 1: cut at B? 


Figure 8.8: 3 Machine, 2 Buffer Hierarchical Decomposition 



(Level 0) 

(Level 1) 
(Level 2) 


Cut 
Bi (Mi) 


Cut 
Bz (A/ 3 ) 


Cut afr* ^ Cut it 
Bs (A/s) Br (A/r) 

(8 Machine line only) 


Figure 8.9: Binary Tree for Aggregated Net Systems for Long Flow Lines 
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Figure 8.10: MPMT-net with the Structure of Uninterpreted Dataflow Graph 
(non-safe) initial marking. The interfiring times of the transitions are as follows: 

Si = 34 — S5 = S24 = $25 — 1.0 (8.11) 

Sg = 37 = 320 = 3 21 = •326 — 2.0 

•*11 = 3 12 = S13 = S 14 = S 15 = S 16 = 3.0 

S 17 = S 27 = 328 = 329 = 4.0 

The probability switches at the immediate transitions is given by the following 
probability rates: 


f 2 = 0.333333, <3 = 0.666667, t 8 = 0.857142, < x0 = 0.142858 (8.12) 
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t 18 = 0.25, fig = 0.75, i 2 2 = 0.50, t 23 — 0.50 


The vector of visit ratios can be computed according to equation (3.17), i.e., once 
we know the throughput of any transition, we can compute the throughput of all 
other transitions. 

The visit ratio normalized for the throughput of 7\ is given below: 


h (1.0000) 
T 4 (0.3333) 
T s (0.6667) 
T 6 (0.3333) 
T 7 (0.0556) 
T n (0.0556) 
r 12 (0.3333) 
T 13 (0.3333) 
r 14 (0.6667) 
Tis (0.2500) 
T 16 (0.7500) 
T 17 (1.0000) 
r 20 (0.2500) 
T 2 i (0.7500) 
r 24 (0.7500) 
Tn (0.7500) 
r 26 (0.7500) 
T 2 7 (0.7500) 
r 28 (1.5000) 
T 29 (0.7500) 


(8.13) 


In the first step we apply flow equivalent aggregation at the subnet shown in 


the dashed box in Figure 8.10. The subnet is a macroplace, i.e., here the mean 


ft 
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P17 



Figure 8.11: Uninterpreted Dataflow Graph with Flow Equivalent Transition 

completion time of a token does not depend on the interarrival process. Table 8.8 
shows the state dependent service rates of t fea in Figure 8.11. 

In the second step, the resulting net system shown in Figure 8.11 is further 
partitioned by £ to form SNi to the left of the cut and SN 2 to the right of the cut. 
By following the SISO Decomposition rule (Section 4.3), we arrive at the aggregated 
net systems in Figure 8.12. The nets are already shown with a solution to LPP3, 
here K = 3. 

Table 8.9 shows the results of the approximation using Marie’s method and 
RTA. For both methods, the introduced error is under 2.0%. Again, Marie’s method 
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k= 1 

k = 2 

conditional throughput 

0.168224 

0.280533 

tangible state space 

8 

36 


Table 8.8: State Dependent Service Rates for t fea in Dataflow Graph 


^exact(^i) “ 0,100859 

RTA 

Marie 

r 

Err 

r 

Err 

3 

-1.75% 

4 

-0.76% 


Table 8.9: Iteration Results for Uninterpreted Dataflow Graph (Error in %) 

shows improved accuracy at the expense of a higher number of iterations. The origi- 
nal net has 57015 states, while .45 1 has 261 states and AS 2 has 84 states. Therefore, 
the reduction of the state space is by over 2 orders of magnitude (57015/261 ss 264). 

8.4 Summary 

The results indicate that SISO cuts show good accuracy whenever the number 
of tokens in the cut circuit is high. Potentially bad results can occur if K = 1. 
For SISO cuts, in general Marie’s method shows improved accuracy over RTA at 
the expense of a higher number of iterations. Sometimes, Marie’s method fails to 
converge. Delay equivalence converges very slowly and does not show improved 
accuracy over RTA or Marie’s method. 

As can be seen from the examples, FEA can lead to potentially bad results 
because of two reasons: 

1. If the MOT depends on the interarrival process, the interaction between sub- 
systems is ignored. This error occurs when cutting transfer lines at the ma- 
chines. 



138 


2. The coefficient of variation of the service times of transitions strongly influ- 
ences the throughput (as could be seen at the 3 machine transfer line with 
reliable machines example). But even the iterative method could not have 
coped with that problem. For the particular case considered the results using 
FEA or RTA would have been the same (Property 5.4.2). 

Unfortunately, none of the presented methods is consistently better than the 
Choong and Gershwin method for the solution of transfer lines. Nevertheless, with 
the above discussion it should be clear that SISO cuts lead to reasonable accuracy 
if the number of tokens in the cut circuit is high. 

The hierarchical decomposition shows varying degrees of accuracy and at 
present it is not clear when it is advantageous to use SISO cuts or hierarchical 
decomposition. 



CHAPTER 9 

Flow Equivalent Aggregation for SIMO-cuts in MPMT-nets 

9.1 Introduction 

In Chapter 7 we have introduced SISO cuts for MPMT-nets. In this chapter 
we generalize SISO cuts for flow equivalent aggregation to SIMO cuts. The basic 
idea is to reduce a SIMO subnet to a state machine. We then formulate a reduced 
net called the flow equivalent net (FEN) [JD91a, JD91b]. The FEN replicates the 
input/output behavior of the SIMO subnet, as well as the expected delay a customer 
suffers. 

9.2 Well-Formed SIMO MPMT Cut 

Let us first formalize a well formed SIMO cut. Consider the net system with 
the cut £ as shown in Figure 9.1. 


SJVi 


0 


■0 


<i> 


4 

712 r 


*21 


*21 


0 


*5i 


SNo 


Figure 9.1: MPMT-net with SIMO Cut £ and Interface Places 


Definition 9.2.1 (Well-Formed SIMO MPMT Cut) Let SNx and SN 2 be the 
subnets generated by a SIMO cut £ to a MPMT-net, q \ 2 the single (input) interface 
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and Q 2 i = {? 2 i,di • ^ r = iGail the multi P le (output) interface. £ is 

well- formed iff: 

1. |?* 2 | = 1 and 'iq € Q? i, |?*| = 1* 

2. By the recursive reduction of macrotransitions and macroplaces, SN 2 can be 
reduced to a state machine. 

Definition 9.2.1 (1) states that once a token has entered an interface place, its 
flow is deterministic (the interface place ha only a single output transition), while 
(2) essentially states that we are not allowed to cut macrotransitions. Note that 
the interface of a well formed SIMO cut satisfies the Product Form Decomposition 
of Definition 2.3.1, i.e., once a token enters an interface, its routing is deterministic 
and we are not allowed to cut at concurrent processes (macrotransitions). 

SN 2 can be structurally reduced to a state machine. Therefore, for the FEN we 
can also choose any state machine representation, as long as there exists a directed 
path from the input interface qu to any of the output interface places in Q 2 1 , and 
we do not violate condition (1) of Definition 9.2.1. Of course, we are interested in 
a minimal representation in order to reduce the state space as much as possible. 
Figure 9.2 shows two realizations. 



Figure 9.2: Two Equivalent Minimal Realizations for a SIMO Flow Equivalent Net 

According to Theorem 3.5.1, both representations are equivalent. Note that 
the realization in Figure 9.2. a introduces additional vanishing markings because p 21 
is never marked for any nonzero period of time. The representation in Figure 9. 2. a 


t 
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directly shows the delay of the token (via the firing rate of r 2 ) and the routing 
probabilities (via the immediate transitions {f 21 • ••Qi})- As we are interested in a 
maximum reduction of the state space, we will use the realization shown in Figure 
9.2.b which does not introduce vanishing markings. 

Definition 9.2.2 (SIMO-MPMT Flow Equivalent Net) Let j\f = ( P , T, F ) 
be an MPMT-net and f a well formed SIMO-MPMT cut through the interface 
places qi 2 and Q 2X defining two subnets SNi = (P,-,Ti, Ff) with i = 1,2, where 
Q — {?i2 U Q21} IQ21I = r as shown in Figure 9.1. The structure of the FEN 
is given by: 

9*2 = {T 2 \T 2 2 ...T 2 r } = t 2 (9.1) 

r 2* = ? 21 > * = 1 . . . r 

as shown in Figure 9.2. 

The goal of the quantitative analysis is to find the state dependent firing rates of 
Tj, fi 2 (n), i = 1 . . . r, n = 1...K, where K is the maximum number of customers 
in SN 2 . 

By reducing SN 2 to the FEN, the original net system (.V, M 0 ) leads to the 
flow aggregated net system, FS = (Fj\f,M 0 ) as shown in Figure 9.3. Structurally 
speaking from M, SN 2 is reduced to the FEN (Definition 9.2.2). The structural 
reduction process is formalized by the following rule. 

Rule 3: SIMQ-FEA Reduction Rule 

1. Structure: FjV = (Pi, T, P), is a flow aggregated net, where: 

• P = P i 

• T = Ti U f 2 
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Figure 9.3: SNi with Substituted Flow Equivalent Net Forming FS 
• F = Fi U ({ 912 } x ^ 2 ) U(f 2 x Q 2 i); [=*■ ?i 2 =* f 2 , Q 21 = t 2 *] 

2. Marking: 

Let ATbea solution to the following linear programming problem: 

maximize ~M{q\ 2 ) 

subject to = Mo 4- C • a > 0 (LPP6) 

a>o, W> 0 


where C is the incidence matrix of the canonical MPMT-net obtained from 
the original net and cut. The initial marking for the flow aggregated net is 
the projection of A/ on Fff (Definition 3.3.2 in Section 3.3). 

The maxim um number of customers K in SN 2 is given by the marking bound at 
the single interface place K = B(q\ 2 ) = ^( 912 )- 

Property 9.2.1 (SIMO-MPMT Reduction Rule) Let (J\f,M 0 ) be a live 
MPMT-net. 

1. For the reduction from the original net system to the flow aggregated net sys- 
tem, the bounds of the places are preserved: 

B(p) = B(p), Vp € Pi 

2. Liveness in the flow aggregated net system is preserved. 
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Proof: It is a direct consequence of the reduction of the SIMO subnet by recursive 
application of the MP and MT rules, as well as Corollary 7.3.4. The transformation 
of the marking empties the least-loaded paths from the input interface to the output 
interface places in the subnet which is being reduced, therefore we maintain the 
marking bound of places. Thus, the liveness of the system is also preserved. ■ 

9.3 Quantitative Aspects 

During the FEA aggregation, we replace a subnet SN 2 by a flow equivalent net 
(Definition 9.2.2). In the quantitative analysis, we have to determine the conditional 
throughput through the output interface places of SN 2 as a function of customers 
n in the system, n = 1 ... K. The subnet is analyzed in short circuit by feeding 
the tokens which exit through the output interface back into the input interface via 
the immediate transitions <21 • • • *21 33 shown in Figure 9.4. This way we keep the 
number of customers in SN 2 constant. 




sn 2 






Figure 9.4: Feedback Construct for Flow Equivalent Net 

Rule 4: SIMO-MPMT Feedback Construct Let t 2 = . - .t 2 } as shown in 

Figure 9.4. The feedback construct is given by: 


1. Structure: A/pgjj = ( P 2 , T , F ), is a feedback construct, where: 


(a) P = P 2 

(b) f = r 2 u t 2 
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(c) F = Fi U ( Qn x f 2 ) U t 2 x {?n}); (=^ 9i 2 = *2 >@21 =* * 2 ) 

i.e., *t\ = q\ x , 4* = 912. * = 1 ■ • • r as shown in Figure 9.4 

2. Marking: 

Let ~M be a solution to the linear programming problem LPP6. The initial 
marking for the SIMO-MPMT Feedback Construct is the projection of ~M on 
jVpEN with the exception of g 12 (the marking of a single interface will be used 
to vary the number of customers in the system). 

We are now ready to formulate the algorithm for the quantitative analysis: 

Algorithm 3: Quantitative Analysis of Well-Formed SIMO-MPMT Cuts 

1. Select a well formed SIMO-MPMT cut £ 

2. Generate the flow aggregated net systems FS (Figure 9.3) by using the SIMO- 
FEA Reduction Rule and the feedback construct by using the SIMO-MPMT 
Feedback Construct Rule as shown in Figure 9.4. 

3. Analyze the feedback construct in Figure 9.4 for n = 1 ... K and obtain the 
conditional throughputs A”(n) of t\, i = 1 . . . r and n = 1 ... K. 

4. In FS, set the the firing rates n\{n) equal to the conditional throughputs 
obtained in step 3, i.e., 

p*(n) * X\n), i= l...r, n = 1...K (9.2) 

5. Analyze FS by solving the underlying CTMC and obtain the desired perfor- 


■d 


mance measures. 
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Figure 9.5: Example of MPMT-Net with Indicated Well-Formed SIMO-Cut. 

512 = -P35, Q 21 = {P20,P14,P15} 

9.4 Example for SIMO-MPMT Cut 

Figure 9.5 shows the example of an MPMT-net (but not FC) without a physical 
interpretation. The dashed box indicates the SIMO cut and the resulting SIMO 
subnet SN 2 . Let us first verify that £ is indeed well-formed by eliminating the 
macroplaces and macrotransitions until SN 2 is reduced to a state machine. By 
visually examining the net (or by using the algorithm outlined in [Sil81]), it is 
straight forward to check, that MT\ = (tin *12, *13} and MT 2 = {t 9 , £ 10 } form 
macrotransitions. After the reduction of the macroplace {p 2 , t 2 , p 2 , £3}, the set of 
transitions formed by MT 3 = {ti, f-t, is, t$, tj} also forms a macrotransition. Figure 
9.6 shows the structure of the resulting state machine. Note that the indicated cut 
is the only possible well-formed cut to the system. 

Let us now step through Algorithm 3: Quantitative Analysis of Well-Formed. 
SIMO-MPMT Cuts : 

1. The cut is already selected and well-formed. 

2 . Before generating the flow aggregated system IFS and the Feedback Construct, 
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MT\ 



MTi 


Figure 9.6: Transformed Subnet SiV 2 in State Machine Form (Macrotransitions are 
Shown Dashed) 


?2I U4 



Figure 9.7: Flow Aggregated System FS for MPMT-Net with SIMO-Cut 

(K = W(q 12 ) = 4) 

we have to solve LPP6. A possible solution is: 

iW(p 3 s) = 4, M{pz) = Af(j*s) = M(pio) = M{pxz) (9-3) 

= M{p\s) = M{p\ & ) = M{p2i) = M{py i ) = M{pj 3 ) (9.4) 

TS and the Feedback Construct with the computed initial marking from LPP6 
are shown in Figure 9.7 and 9.8 respectively. Now the number of customers 
^(712) = -M’O’cb) = 4 . 

3. We now vary the number of customers in the system from n — 1 ... 4. Table 9.1 
shows the conditional throughputs of f 2 obtained by varying the number of 
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Figure 9.8: Feedback Construct for MPMT-Net with SIMO-Cut (Number of Cus- 
tomers Shown: A/O 712 ) = 1) 

customers in the system, i.e., by varying M 0q 12 = 1 ... K, as well as the tan- 
gible state space (column 5). 
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Ill 


0.036307 

0.036105 

52 

IQ 

0.274374 

0.066655 

0.065204 

168 

IQ 

0.337118 

EEB 1 

0.085549 

376 

ID 

0.370327 

0.102093 

0.097113 

680 


Table 9.1: Conditional Throughputs using Flow Equivalent Aggregation for SIMO 
Cut at MPMT-net 


4. In this step we substitute the conditional throughputs obtained in step 3 back 
into J-S. 

5. Table 9.2 shows the results of the aggregation. 

By aggregating SiV 2 to a FEN, we have introduced very little error, while at the 
same time reducing the tangible state space by a factor of roughly 37. 
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mmm 


tang, state space 

FEA 

0.109755 

0.024093 

0.024274 

88720 

exact 

0.108199 

0.024497 

0.024191 

2424 


mam 


mSEmm 

— 


Table 9.2: Final Results Using Flow Equivalent Aggregation for SIMO Cut at 

MPMT-net 

9.5 Summary 

In this chapter, we have introduced a new class of cuts to MPMT-nets, where 
we allow the particular case of a multiple output interface. We require that the 
interface has the characteristics of a server in a network that allows Product Form 
Decomposition which was introduced in Definition 2.3.1. 

We have generalized the SISO interface for FEA to flow equivalent nets which 
capture the input/output behavior of the SIMO subnet. For the shown example, the 
accuracy of the method is good with a large reduction in the state space. However, 
it has to be kept in min d, that good accuracy can by no means be guaranteed 
especially if there exists strong dependence of the MCT on the interarrival times or 
large coefficient of variation in the service time of transitions. 













CHAPTER 10 

Case Study of a Dual Arm Robotics System for Assembly 
10.1 Introduction 

In this chapter we present an application from the area of robotics as applied 
to space activities. The demo consists of the insertion of a strut into a structure (i.e., 
two previously assembled struts). We will first describe the physical and hardware 
configuration of the system. Then we will describe (in words) the task at hand and 
present the Petri net model. We then apply flow equivalent aggregation which was 
presented in the previous chapters. 

The following description of the hardware and software components at the 
Center for Intelligent Robotics for Space Exploration (CIRSSE) is adapted from 
[LS91]. 

The Theory of Intelligent Machines , as proposed in [Sar79], describes a hier- 
archical organization of the functions of an autonomous robot into three levels: on 
the lowest level is the Execution Level containing the hardware and basic control 
functions, further up in the hierarchy is the Coordination Level integrating the ca- 
pabilities of the intelligent machine across hardware systems, and an Organization 
Level providing higher- level planning and reasoning capabilities. The nature of the 
intelligent machine levels axe different: The Execution Level provides the control 
functions, the Coordination Level is characterized by scheduling and operations re- 
search issues, and the Organization Level employs artificial intelligence techniques 
for task planning. A diagram of the intelligent machine is shown in Figure 10.1. The 
model is organized according to the principle of Increasing Precision With Decreas- 
ing Intelligence [SV77, SV88]. Detailed run-time information is maintained at the 
lowest Execution Level and only relevant status information is propagated upward 
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through the hierarchy. 



Figure 10.1: Logical Model of Intelligent Machine 


In the case study, the entire strut assembly operation is started by a single 
assemble_strut command from the Organization Level to the Coordination Level 
Dispatcher. Therefore, operation of the Organization Level is omitted from this case 
study and only the execution and coordination level are considered. 

10.2 Hardware and Software Configuration 

The testbed hardware at CIRSSE consists of an 18-DOF manipulator system, 
a 5 camera vision subsystem, their associated control computers, and several UNIX 
workstations. The 18-DOF manipulator is built from two 6-DOF PUMA robots (a 
PUMA 560 and a PUMA 600) mounted on a custom-built robot transporter with 
two 3-DOF carts. The carts travel along a 12 foot linear track and provide ±45° of 
tilt and ±150° of rotation. This manipulator subsystem is controlled by a Motion 
Control System. For the experiment at hand, only one of the manipulators was used. 


« 
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The vision subsystem has five cameras that may be used individually or pair- 
wise, a laser scanner for pointing light, and a (VME-based) Datacube image pro- 
cessing system. Additional processing capabilities are provided by four UNIX work- 
stations (predominantly Sun 4 and Sparc Stations) attached to an Ethernet which 
connects all of the testbed subsystems. 

As shown in Figure 10.1, the Coordination Level of the intelligent machine is 
organized into a tree structure, with a Dispatcher at the root and multiple Coordi- 
nators at the leaves. 

The Dispatcher is the first component of the intelligent machine that deals with 
the machine as a >vhole. It coordinates actions across Execution Level functions, 
e.g., move a manipulator to a position determined by visual sensing. The Dispatcher 
defines the executions of the subtasks. 

Petri nets are used to implement the event-driven environment of the dis- 
patcher and the coordinators [WS90]. In this case study, we concentrate on the 
level of the hierarchy represented by the dispatcher. 

10.3 Case Study 

As a case study, the dispatcher realized the insertion of a strut into an existing 
V-node. A schematic diagram of the experimental setup is shown in Figure 10.2. 
In addition to the two cameras mounted on the wrist of the robot, two additional 
cameras at the ceiling are used, which is not shown in the figure. 

The task is as follows: After the initialization of the robot and the vision sys- 
tem, the cameras mounted on the ceiling find the preassembled V-struts (essentially 
two struts from a previous assembly operation). The robot then moves to the end 
nodes of the V-strut lying on the table and determines the exact location of the 
insertion point with the wrist cameras. The robot then proceeds to move to the 
rack and picks up the strut to be inserted. After the pickup operation, the robot 
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moves back to the table and inserts the strut into the V-strut to form a triangle. 

Figure 10.3 shows a high level representation of the dispatcher. The four 
Macrotransitions represent the following activities: 

MT X : Initialize the robot and vision system and localize the V-strut (i.e., strut A 
and B which have previously been assembled) with the ceiling camera. 

MTi : Move to the nodes of the V-strut and use the wrist camera to obtain the exact 
position. 

MT Z : Move to the rack, localize both ends of strut C with the wrist camera in order 
to obtain its exact position. 

MT+: Pick up strut C and insert it into the V-strut. Shut down the system. 

As can be easily checked, the net in Figure 10.3 is not strongly connected. We 
obtain a strongly connected net (an MPMT-net) by merging the places req.demo 
and demo-complete. The net is now repetitive and the performance analysis can be 
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node^pose 


robot 


V-found 


kd-#-ho-|; 


sPP-rdyf 

req.demo / MT\ M MTz 



Figure 10.3: High Level Representation of Dispatcher 


done. The obtained cycle time is the required time from the start to the finish of 
the activities. Figure 10.4 shows a detailed representation of the aggregated transi- 
tions. Note that all shown subnets with the exception of MT\ are macrotransitions. 
In MTi, place PPjready is a choice place. Transitions homed and notjxome have 
enabling functions attached to them, i.e., they will fire depending on the position of 
the robot. Here we assume, that the robot is already homed before the demonstra- 
tion. Therefore, the resulting model is a MG. The complete net can be obtained by 
merging MT\ through MT 4 at the places which bear the same label. As shown in 
the figure, only places camera and req.demo are marked. 

Table 10.1 gives a listing of all transitions, together with a brief description of 
the associated activity and the required time to completion (i.e., the mean interfiring 
time). The values were obtained experimentally, by stepping the testbed through 
the assembly operations and measuring the times. 

The marking of the dispatcher is safe (i.e., 1-bounded) and no internal circuits 
are present. We therefore choose flow equivalent aggregation for the approximate 
analysis of the system. The dispatcher is partitioned into four subsystems MTi , i = 
1...4. All cuts to the system are SISO. After the canonical transformation, a 
solution to the linear programming problem LPP3 shows, that each of the subnets 
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should be analyzed with a single token. 

Table 10.2 shows the results of the analysis. The original net has 62 tangible 
states. The mean time to complete the demo was measured to be 207 seconds, while 
the mean time as obtained by solving the underlying CTMC of the Petri net system 
is 205.7 seconds. This is also the results of the flow equivalent aggregation therefore, 
in this particular case the aggregation is exact. 

10.4 Summary 

From this case study, we can draw two conclusions: 

• The agreement of the measured value of the completion time of the demo with 
the value obtained by solving the CTMC is good. This substantiate the fact, 
that we can use the stochastic Petri net model for the performance analysis. 

• In certain cases, FEA can lead to exact results. This comes as no surprise. In 
the example in Section 5.2.3 we have already encountered a case where the 
aggregation was exact. In that case, no internal loops or trapped tokens were 
present, furthermore the marking was safe. This is exactly the same case here. 
In this particular case, the MCT is independent of the interarrival process. 
Furthermore, in cyclic queues with exponential servers and a single customer, 
the cycle time is just the sum of the service times of the servers. 

The analyzed case study is the simple task of inserting a strut into a previ- 
ously existing structure. In the future, the testbed system will perform far more 
complicated tasks. Therefore, the dispatcher will be more complex and its associ- 
ated state space will grow large. The performance analysis will become increasingly 
difficult, because of the large state space. By using the presented methodology, we 
can predict the performance (i.e., the time required to execute a task) before the 
system is actually implemented. 
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cameras CAM_avail init_CAM CAM_ready find_V 





Figure 10.4: Exploded Composite Transitions MTi, MT 2 , MT 3 and MT 4 of CIRSSE 
Testbed 
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Interfiring times in seconds 

Transition 

time 

Description 

start 

0.0 

start the demo (immediate) 

init-CAM 

0.8 

initialize cameras (wrist and ceiling) 

init-ROB 

11.2 

initialize robot 

init-PP 

1.0 

initialize path planner 

moveJiome 

10.0 

move robot to home position 

find.V 

10.4 

find V-strut with ceiling camera 

calc.apprl 

0.8 

compute approach to V-strut (node 1) 

PP-nodel 

2.0 

plan path to node 1 

movejnodtl 

13.3 

move robot to node 1 

calc.apprS 

1.3 

compute approach to V-strut (node 2) 

refine-node 1 

5.8 

locate node 1 with wrist camera and refine location 

PP.node2 


plan path to node 2 

move.nodeS 

10.3 

move robot to node 2 

refinejnode2 

5.8 

locate node 2 with wrist camera and refine location 

rackjrdy 

0.0 

Ready to start activities at rack (immediate) 

PP-rackL 

ma 

plan path to left node of strut on the rack 

movtjrackL 

20.0 

move robot to left node of strut on the rack 

refinejrackL 

6.6 

locate left node of strut on the rack with wrist camera 

PPjrackR 


plan path to right node of strut on the rack 

movtjrackR 

10.0 I 

move robot to right node of strut on the rack 

refine-rackR 


locate right node of strut on the rack with wrist camera 

calc-apprR 

ma 

calculate approach for pickup of strut 

PP.rack 

ma 

plan path for pickup of strut 


Ma 

move robot to rack 

rack-done 


activities at rack done (immediate) 

pickupstrut 

9.0 

pickup strut from rack 

calc-apprl 

■SI 

calculate approach for insert of strut into V-strut 

PPJnsert 


plan path for insert of strut 

move-insert 

20.8 

move robot to insertion point 

insert-strut 

34.0 

insert strut into V-strut 

PP .final 

2.0 

plan path to final position of robot 

move-final 

10.7 

move robot to final position 

$hut-CAM 

0.5 

shut down vision system 

shut-ROB 

13.4 

shut down robot 

finish 

0.0 

everything finished 


Table 10.1: Description and Interfering Times of Transitions in Dispatcher (Exper- 
imental Data) 
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Time unit in seconds 

aggr. net 

X 

r 

tang, state space 

MT X 

0.044192 

22.6283 

8 

MT-i 

0.026242 

38.1068 

12 

MT Z 

0.017819 

56.1199 

11 

mt 4 

0.011253 

88.8652 

31 


Table 10.2: Results of the Flow Equivalent Aggregation of the Subnets Composing 
the CIRSSE Testbed 




CHAPTER 11 
Conclusions 


11.1 Discussion 

In the present work, we have introduced the structural decomposition of 
Marked Graphs (MGs) and Macroplace/Macrotransition nets (MPMT-nets), a sub- 
class of Petri nets which allows limited choice and synchronizations. Single cuts lead 
to aggregated net systems, where one of the subnets is reduced to a single transition 
and the basic skeleton, where both subnets are reduced to a transition. 

The cuts can be categorized according to their interface places, i.e., places 
which both subnets have in common. Single Input/Single Output (SISO) cuts do 
not introduce any synchronization constraints or marking bound changes and are 
therefore preferred. 

In MGs, Single Input/Multiple Output (SIMO/MISO) and Multiple In- 
put/Multiple Output (MIMO) cuts introduce additional synchronization con- 
straints, and can lead to a different net behavior. In some cases, the synchronization 
of concurrent processes can lead to bad results. 

The introduced cuts (with the exception of the SIMO cut in MPMT-nets) 
lead to a product form representation of the basic skeleton, i.e., it can be solved by 
analytic means (e.g., Mean Value Analysis). 

Once the net is decomposed, a variety of methods can be applied for the 

approximate analysis. 

• Response Time Approximation: [DJS92, JSS92b] 

This method is introduced in the present work. From the analysis of the 
aggregated net system, RTA only uses the throughput of the analysis of the 
aggregated net system. Other information such as the token distribution in 
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the subnet is not used. RTA is very robust, converges quickly and shows good 
accuracy. 

• Marie’s Method: [Mar79] 

In Marie’s method, we use information about the token distribution in the 
aggregated net systems. In all considered SISO cuts, the additional informa- 
tion leads to an improved accuracy over RTA, although convergence is usually 
slower. In one particular case, Marie’s method failed to converge. 

• Flow Equivalent Aggregation (FEA): [JD91a, LZGS84] 

FEA is computationally very efficient, but has a severe drawback. If the mean 
completion time of a subnet depends strongly on the interarrival process, the 
aggregation can lead to very bad results as seen in Section 8.2.2. FEA should 
be used with caution, and the subnets should be tested whether a strong 
dependence of the mean completion time on the arrival process exists (as 
outlined in Section 5.2.3). 

• Delay Equivalence (DE): [LW91, WL91] 

Delay equivalence was introduced in the context of Petri nets by Woodside et 
al. State dependent DE shows very poor convergence. In the presented exam- 
ples, this method failed in most cases. State independent DE converges more 
often, but still did not reach a fixed point solution in all cases. Furthermore, 
the number of iterations is high compared to RTA or Marie’s method. When 
DE converges, the accuracy is similar to RTA. In DE, information about the 
throughput and the token distribution in both subnets is used. 

Based on our experimental results, we can now give guidelines for using the 
different methods: 

• Delay Equivalence shows no advantage over any of the other methods and 


should not be used. 
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• Flow Equivalent Aggregation should only be used when no internal cycles are 
present, or it has been verified that the mean completion time of the subnet 
does not depend on the arrival process (Section 5.2.3). In any case, FEA can 
yield good initial values for Marie’s method and RTA. 

• Marie’s Method is the method of choice when single SISO cuts are considered. 
Although convergence is often slower than in RTA, the accuracy was better in 
all considered single SISO cuts. 

• Response Time Approximation should be used when FEA is not applicable, 
computation time is at a premium or Marie’s does not converge. This repre- 
sents the classical cost versus quality tradeoff. 

Unfortunately, for the hierarchical decomposition we are not able to give such 
guidelines as no scheme is consistently better. 

In MPMT-nets, FEA was extended to SIMO cuts. This allowed the formu- 
lation of a flow equivalent server which captures the input/ output behavior of the 
subnet. 

As a real life case study, a dual arm robotics testbed for assembly operations 
was analyzed. The results show, that the proposed aggregation is suitable in that 
context. 

11.2 Outlook 

Based on the present work, we can identify several research directions: 

1. If the customer load at the cut is low (e.g., I< = 1), the presented methods 
can lead to bad results. One approach could be to capture the higher mo- 
ments of the probability distribution of the residence time of the tokens in the 
interface places. This way, we would be able to use more information about 
the aggregated net systems, which could lead to a better approximation. 
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2. At present, the choice of the cuts is limited. By devising different basic skele- 
tons, which take the multiple nature of the interface into account, we could 
allow more general types of cuts. 

3. The almost product form decomposition in Petri nets can be further explored. 
A first step in that direction was done in Chapter 9, where SIMO cuts in 
MPMT-nets were introduced. 
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